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CONJUGATES FORMED FROM HEAT SHOCK PROTEINS AND OUGO- OR POLYSAC- 
CHARIDES 



5 Field of the invention 

The present invention relates to conjugated compounds 
consisting of heat shock proteins and polysaccharides or 
oligosaccharides, in particular those polysacccharides or 
10 oligosaccharides derived from the capsule of pathogenic 
microorganisms. Such compounds are capable of inducing the 
formation of anti-polysaccharide antibodies and are 
accordingly useful as vaccines for use in man and in 
animals.* 

15 

State of the art 

Bacteria are the aetiological agents for a wide range of 
disease conditions. 

20 

Examples of such diseases include meningitis caused by 
Neisseria meningitidis and other infections caused by 
Haemophilus influenzae Type b (Hib) or Streptococcus 
(including Pneumococcus) , typhoid fever caused by 
25 infection with Salmonella typhi, intestinal disease caused 
by non-typoidal Salmonella or Shigella bacteria. 

It is known that protective immunity to capsular bacteria 
is mediated by antibodies to the capsular polysaccharides. 
30 It is also known that, in order to obtain sufficient 
stimulation of the immune system, it is necessary to 
conjugate capsular polysaccharides to carrier proteins 
(Robbins et al, J. Infect. Dis., 1990, 161,821-832). 

35 In particular, there have been described in the literature 
conjugated compounds consisting of polysaccharides (for 
example Group C meningococcal polysaccharide (MenC) , . Hib 
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and Group A meningococcal polysaccharide (MenA) ) and 
proteins such as CRM-197 (a peptide derived from 
Corynebacterium diphtheriae) , TD (Diphtheria toxoid) or TT. 
(Tetanus toxoid - see Peeters et al. Inf.Immun., 
5 (Oct. 1991), 3504-3510; Claesson et al., J. Pediatrics St 
Louis, 112151, 695-702, (May 1988). 

Some such vaccines are already used with good results in 
clinical practice. However, there exists the need to 
10 identify novel protein carriers which impart to the 
conjugates immunogenic properties better than those 
achieved with the carriers used hitherto. 

The present invention relates to the use of heat shock 
15 proteins as a protein carrier to increase the immunogenic 
response of oligosaccharides and polysaccharides. 

Heat shock proteins are known to contain a significant 
number of T epitopes and thus to stimulate the cellular 
20 immune system. 

A conjugated compound of the heat shock protein of 
Mycobacterium bovis (65 kDa) , as a carrier for a malarial 
epitope, has been described as inducing a marked immunity 

25 in animals pre-immunised with Bacillus Calmette-Guerin 
(BCG) without requiring adjuvants (Lussow et al Eur. J. 
Immunol., 1991, 21,2297-2302). It is however to be noted 
that the effects observed in Lussow et al relate to T cell 
dependent effects exhibited by peptides (which are well 

30 known to be T-cell dependent) conjugated to heat shock 
proteins. 

More particularly, because the heat shock proteins are 
well conserved across bacteria of different strains and 
35 type, adventitious infection with bacteria, which is a 
continuous process, will ensure that the immune system 
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remains sensitised to heat shock proteins, thus ensuring a 
good response to the conjugate compounds of the invention 
either at primary vaccination or on administration of a 
booster vaccination. 

5 

The present invention permits the use of. bacterial 
capsular polysaccharides and oligosaccharides to be used 
without adjuvants (although adjuvants can be used) . 

* 10 Large numbers of children are given BCG vaccine (which 
will include bacterial heat shock proteins) to guard 
against tuberculosis and therefore a conjugate of the 
present invention containing a heat shock protein as the 
carrier would find a large number of subjects already pre- 
15 immunised with the carrier precisely as a result of the 
BCG vaccination which they have undergone. 

Again, since the heat shock proteins are highly conserved 
even the population which has not been vaccinated with BCG 

20 can easily develop immunity (as a result of natural 

interaction with other bacteria) and can hence find itself 
in a state of being able to develop a good immune response 
following vaccination with a conjugate composed of a heat 
shock protein and a T cell-independent antigen 

25 (oligosaccharide or polysaccharide) • Thus the carriers of 
the present invention uniquely exploit the high 
conservation of heat shock proteins across bacteria and T- 
cell memory to ensure high titre vaccination. 

3 0 Summary of the invention 

According to the present invention, there is provided a 
conjugate compound comprising at least one heat shock 
protein or portion thereof including at least one 
35 immunostimulatory domain and at least one oligosaccharide 
or polysaccharide. 
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The heat shock protein may be any heat shock protein 
capable of exhibiting an immunostimulatory effect in 
animals, preferably humans. 

5 

The heat shock proteins are highly conserved in bacteria, 
parasites and mammals. Any heat shock protein can be used 
in the conjugates of the present invention, provided it 
exhibits a positive immunostimulatory effect in the 
10 intended immunisation subject without significant 

deleterious effects. Specific, non-limiting examples 
include heat shock proteins from Helicobacter pylori, P. 
aeruginosa, C. trachomatis and M. leprae, especially the 
hsp60 group of heat shock proteins. 

15 

More particularly, three heat shock proteins are 
specifically exemplified herein, namely, M. bovis BCG 
GroEL-type 65kDa hsp (hspR65) , Recombinant M . tuberculosis 
DnaK-type 70kDa hsp (hspR70) and a novel heat shock 
20 protein from H. pylori. 

The H. pylori heat shock protein (hsp) is a protein whose 
nucleotide and amino acid sequence is given in Fig. 3 and 
whose molecular weight is in the range of 54-62 kDa, 

25 preferably about 58-60 kDa. This hsp belongs to the group 
of Gram negative bacteria heat shock proteins, hsp60. In 
general, hsp are among the most conserved proteins in all 
living organisms, either prokaryotic and eukaroytic, 
animals and plants, and the conservation is spread along 

30 the whole sequence. 

The conjugate may contain one or more heat shock proteins 
or immunostimulatory domains thereof. The heat shock 
proteins may the same or different. Preferably however, 
35 one heat shock protein or a portion containing one or more 
immunostimulatory domains is present. 
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As used herein, the term "immunostimulatory domain" refers 
to a region of a heat shock protein amino acid sequence 
capable of enhancing the immune reaction of a subject 
5 mammal to a polysaccharide or oligosaccharide component of 
a conjugate compound including the domain. 

An advantage of using only specific domains from complete 
heat shock proteins is that it is possible selectively not 

10 to include domains common to human heat shock proteins. 

For human vaccination this is advantageous as such regions 
will not affect the immunostimulatory effect of the heat 
shock protein as they will be recognized as "self". In 
addition any immunity that is stimulated against such 

15 "self" regions might lead to autoimmunity. 

Suitable domains of the hsp60 family of heat shock protein 
are identified in Figure 2 by underlining of sequence of 
reduced homology with the human heat shock protein, 
20 Functional sub domains within the domains shown in Figure 
2 may also be used, as can domain and sub domain 
combinations. 

The skilled man can readily ascertain for a given heat 
25 shock protein which domains or epitopes are responsible 

for the immunostimulatory action and prepare modified heat 
shock protein containing only those domains or a sub set 
thereof . 

3 0 The oligosaccharide or polysaccharide component of the 
conjugate compound may be the complete capsular 
polysaccharide or oligosaccharide of any pathogenic 
microorganism against which vaccination is indicated or a 
portion thereof capable of eliciting protective immunity. 

35 The oligosaccharide or polysaccharide may be from a single 
bacteria or from two or more bacteria. 
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Particular non-limiting examples of bacteria which may be 
targeted include: Haemophilus influenzae Type b (Hib) , 
Streptococcus (including pneumococcus) , Salmonella 
5 especially Salmonella typhi, intestinal disease caused by 
non-typoidal Salmonella or Shigella bacteria. 

According to a particular embodiment of the invention, 
there have been prepared conjugates consisting of 
" 10 oligosaccharides of the Meningococci C (MenC) group and 
hsp . 

According to a further particular embodiment of the 
present invention the hsp used for this purpose are hspR65 
15 and hspR70. 

In a second aspect of the invention, there is provided a 
process for producing conjugate compounds according to the 
present invention which comprises covalently bonding a 
20 heat shock protein or portion thereof including at least 
one immunostimulatory domain thereof to at least one 
oligosaccharide or polysaccharide. 

The oligosaccharide or polysaccharide is preferably 
25 isolated from the bacterium to be targetted, but may be 
produced synthetically. 

The heat shock protein may be isolated from its naturally 
occurring source or produced synthetically. Preferably 
30 the heat shock protein is produced by recombinant DNA 
technology using the techniques described in the general 
of the description herein. 

Preferably the oligosaccharide or polysaccharide is 
35 modified prior to conjugation with the heat shock protein 
or portion thereof to provide reactive sites for 
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conjugation. Suitably this involves introducing active 
functional groups, such as amino groups at the end groups 
of the oligosaccharide or polysaccharide. The thus 
modified oligosaccharide or polysaccharide may then be 
5 activated using a linking group, such as succinimide and 
conjugated to the heat shock protein or portion thereof. 

In a third aspect of the invention, there is provided a 
conjugate compound according to the first aspect of the 
10 invention for use as a pharmaceutical, preferably as 
vaccine. 

In a fourtr. aspect of the invention, there is provided the 
use of the conjugate compound according to the first 
15 aspect of the invention in the manufacture of a medicament 
for vaccination against bacterial infection. 

In a fifth aspect of the invention, there is provided a 
method of vaccination comprising administering an 
20 immunologically effective amount of a conjugate compound 
according to the first aspect of the invention. 

In a sixth aspect of the invention, there is provided a 
vaccine or therapeutic composition comprising one or more 
25 conjugate compounds according to the first aspect of the 
invention and a pharmaceutically acceptable carrier. 

Preferably the composition is a vaccine composition and 
may include other excipients such as adjuvants, as 
3 0 necessary (see Section entitled "Vaccines" in the 
description below) . 

In a seventh aspect of the invention, there is provided a 
method for the preparation of a vaccine comprising 
35 bringing one or more conjugate compounds of the first 
aspect of the invention into association with a 
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15 



20 



25 



30 



pharmaceutical^ acceptable carrier and optionally an 



adjuvant. 

Brief Description of the Drawings 

Figure 1 shows the results of immunising mice with a 
hspR65/MenC conjugate and comprises ELISA results for 
anti-MenC in the blood. In Figure 1A the mice were 
preimmunized with BCG and in Figure IB they were not. The 
Figure show the results with hspR65-MenC (o) conjugate or 
hspR70-MenC conjugate, in PBS. Control groups of mice 

were immunised with the MenC oligosaccharide alone (A) or 
with a CRM197-MenC conjugate vaccine in aluminium 
hydroxide (□) . 

Figure 2 shows the amino acid sequence of the Helicobacter 
pylori heat shock protein and compares it with related 
heat shock proteins from P. aeruginosa, C. trachomatis, M. 
leprae and H. sapiens. The bars under the sequence 
indicate domains on reduced homology between sequences 1 
to 4 and the human heat shock protein. 

Figure 3 is the nucleotide and amino acid sequences of the 
Helicobacter pylori heat shock protein. 

Detailed Description of Embodiments 

1^ General Methodology 

The practice of the present invention will employ, unless 
otherwise indicated, conventional techniques of molecular 
biology, microbiology, recombinant DNA, and immunology, 
which are within the skill of the art. Such techniques 
are explained fully in the literature. See e.g., 
Sambrook, et al., MOLECULAR CLONING; A LABORATORY MANUAL, 
SECOND EDITION (1989); DNA CLONING, VOLUMES I AND II (D.N 
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Glover ed. 1985); OLIGONUCLEOTIDE SYNTHESIS (M.J. Gait ed, 
1984); NUCLEIC ACID HYBRIDIZATION (B. D. Hames & S.J. 
Higgins eds. 1984); TRANSCRIPTION AND TRANSLATION (B.D. 
Hames & S.J. Higgins eds. 1984); ANIMAL CELL CULTURE (R.I. 
Freshney ed. 1986); IMMOBILIZED CELLS AND ENZYMES (IRL 
Press, 1986); B. Perbal, A PRACTICAL GUIDE TO MOLECULAR 
CLONING (1984); the series, METHODS IN ENZYMOLOGY 
(Academic Press, Inc.); GENE TRANSFER VECTORS FOR 
MAMMALIAN CELLS (J.H, Miller and M.P. Calos eds. 1987, 
Cold Spring Harbor Laboratory), Methods in Enzymology Vol. 
154 and Vol. 155 (Wu and Grossman, and Wu, eds., 
respectively), Mayer and Walker, eds. (1987), 
IMMUNOCHEMICAL METHODS IN CELL AND MOLECULAR BIOLOGY 
(Academic Press, London), Scopes, (1987), PROTEIN 
PURIFICATION: PRINCIPLES AND PRACTICE, Second Edition 
(Springer-Verlag, N.Y.), and HANDBOOK OF EXPERIMENTAL IM- 
MUNOLOGY, VOLUMES I-IV (D.M. Weir and C.C. Blackwell eds 
1986) . 

20 Standard abbreviations for nucleotides and amino acids are 
used in this specification. All publications, patents, 
and patent applications cited herein are incorporated by 
reference. 

25 2_s_ Definitions 

Heat shock proteins that can be used in the present 
invention include those polypeptides mentioned above and 
polypeptides with minor amino acid variations from the 
30 natural amino acid sequence of the protein; in particular, 
conservative amino acid replacements are contemplated. 

Conservative replacements are those that take place within 
a family of amino acids that are related in their side 
35 chains. Genetically encoded amino acids are generally 
divided into four families: (l) acidic = aspartate, 
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glutamate; (2) basic = lysine, arginine, histidine; 

(3) non-polar = alanine, valine, leucine, isoleucine, 
proline, phenylalanine, methionine, tryptophan; and 

(4) uncharged polar = glycine, asparagine, glutamine, 
5 cystine, serine, threonine, tyrosine. Phenylalanine, 

tryptophan, and tyrosine are sometimes classified jointly 
as aromatic amino acids. For example, it is reasonably 
predictable that an isolated replacement of a leucine with 
an isoleucine or valine, an aspartate with a glutamate, a 

10 threonine with a serine, or a similar conservative 

replacement of an amino acid with a structurally related ■ 
amino acid will not have a major effect on the biological 
activity. Polypeptide molecules having substantially the 
same amino acid sequence as the protein but possessing 

15 minor amino acid substitutions that do not substantially 
affect the functional aspects are within the definition of 
the protein. 

A significant advantage of producing the heat shock 
20 protein by recombinant DNA techniques rather than by 

isolating and purifying a protein from natural sources is 
that equivalent quantities of the protein can be produced 
by using less starting material than would be required for 
isolating the protein from a natural source. Producing 
25 the protein by recombinant techniques also permits the 
protein to be isolated in the absence of some molecules 
normally present in cells. Indeed, protein compositions 
entirely free of any trace of human protein contaminants 
can readily be produced because the only human protein 
30 produced by the recombinant non-human host is the 

recombinant protein at issue. Potential viral agents from 
natural sources and viral components pathogenic to humans 
are also avoided. 

35 The term "recombinant polynucleotide" as used herein 

intends a polynucleotide of genomic, cDNA, semisynthetic, 
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or synthetic origin which, by virtue of its origin or 
manipulation: (1) is not associated with all or a portion 
of a polynucleotide with which it is associated in nature, 
(2) is linked to a polynucleotide other than that to which 
5 it is linked in nature, or (3) does not occur in nature. 

The term "polynucleotide" as used herein refers to a 
polymeric form of a nucleotide of any length, preferably 
deoxyribonucleotides, and is used interchangeably herein 

10 with the terms "oligonucleotide" and "oligomer." The term 
refers only to the primary structure of the molecule.. 
Thus, this term includes double- and single-stranded DNA, 
as well as antisense polynucleotides. It also includes 
known types of modifications, for example, the presence of 

15 labels which are known in the art, methylation, end 
"caps," substitution of one or more of the naturally 
occurring nucleotides with an analog, internucleotide 
modifications such as, for example, replacement with 
certain types of uncharged linkages (e.g., methyl 

20 phosphonates, phosphotriesters , phosphoamidates, 
carbamates, etc.) or charged linkages (e.g., phos- 
phorothioates, phosphorodithioates, etc.), introduction of 
pendant moieties, such as, for example, proteins 
(including nucleases, toxins, antibodies, signal peptides, 

25 poly-L-lysine, etc.), intercalators (e.g., acridine, 
psoralen, etc.), chelators (e.g., metals, radioactive 
species, boron, oxidative moieties, etc.), alkylators 
(e.g., alpha anomeric nucleic acids, etc.). 

30 By "genomic" is meant a collection or library of DNA 
molecules which are derived from restriction fragments 
that have been cloned in vectors. This may include all or 
part of the genetic material of an organism. 

35 By "cDNA" is meant a complementary mRNA sequence that 
hybridizes to a complimentary strand of mRNA. 
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As used herein, the term "oligomer" refers to both primers 
and probes and is used interchangeably herein with the 
term "polynucleotide." The term oligomer does not connote 
5 the size of the molecule. However, typically oligomers 
are no greater than 1000 nucleotides, more typically are 
no greater than 500 nucleotides, even more typically are 
no greater than 250 nucleotides; they may be no greater 
than 100 nucleotides, and may be no greater than 75 nuc- 
10 leotides, and also may be no greater than 50 nucleotides 
in length. 

The term "primer" as used herein refers to an oligomer 
which is capable of acting as a point of initiation of 
synthesis of a polynucleotide strand when used under 
appropriate conditions. The primer will be completely or 
substantially complementary to a region of the 
polynucleotide strand to be copied. Thus, under 
conditions conducive to hybridization, the primer will 
anneal to the complementary region of the analyte strand. 
Upon addition of suitable reactants, (e.g., a polymerase, 
nucleotide triphosphates, and the like), the primer will 
be extended by the polymerizing agent to form a copy of 
the analyte strand. The primer may be single-stranded or 
alternatively may be partially or fully double-stranded. 

The terms "analyte polynucleotide" and "analyte strand" 
refer to a single- or double-stranded nucleic acid 
molecule which is suspected of containing a target 
30 sequence, and which may be present in a biological sample. 

As used herein, the term "probe" refers to a structure 
comprised of a polynucleotide which forms a hybrid 
structure with a target sequence, due to complementarily 
35 of at least one sequence in the probe with a sequence in 
the target region. The polynucleotide regions of probes 
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may be composed of DNA, and/or RNA, and/or synthetic 
nucleotide analogs. Included within probes are "capture 
probes" and "label probes". 

5 As used herein, the term "target region" refers to a 

region of the nucleic acid which is to be amplified and/or 
detected. The term "target sequence" refers to a sequence 
with which a probe or primer will form a stable hybrid 
under desired conditions. 

10 

The term "capture probe" as used herein refers to a . 
polynucleotide probe comprised of a single-stranded 
polynucleotide coupled to a binding partner. The 
single-stranded polynucleotide is comprised of a targeting 

15 polynucleotide sequence, which is complementary to a 

target sequence in a target region to be detected in the 
analyte polynucleotide. This complementary region is of 
sufficient length and complementarily to the target 
sequence to afford a duplex of stability which is suf- 

20 ficient to immobilize the analyte polynucleotide to a 
solid surface (via the binding partners) . The binding 
partner is specific for a second binding partner; the 
second binding partner can be bound to the surface of a 
solid support, or may be linked indirectly via other 

25 structures or binding partners to a solid support. 

The term "targeting polynucleotide sequence" as used 
herein refers to a polynucleotide sequence which is 
comprised of nucleotides which are complementary to a 
30 target nucleotide sequence; the sequence is of sufficient 
length and complementarily with the target sequence to 
form a duplex which has sufficient stability for the 
purpose intended. 

35 The term "binding partner" as used herein refers to a 
molecule capable of binding a ligand molecule with high 
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specificity, as for example an antigen and an antibody 
specific therefor. In general, the specific binding part- 
ners must bind. with sufficient affinity to immobilize the 
analyte copy /complementary strand duplex (in the case of 
5 capture probes) under the isolation conditions. Specific 
binding partners are known in the art, and include, for 
example, biotin and avidin or streptavidin, IgG and 
protein A, the numerous known receptor-ligand couples, and 
complementary polynucleotide strands. In the case of 

10 complementary polynucleotide binding partners, the 

partners are normally at least about 15 bases in length, 
and may be at least 40 bases in length; in addition, they 
have a content of Gs and Cs of at least about 40% and as 
much as about 60%. The polynucleotides may be composed of 

15 DNA, RNA, or synthetic nucleotide analogs. 

The term "coupled" as used herein refers to attachment by 
covalent bonds or by strong non-covalent interactions 
(e.g., hydrophobic interactions, hydrogen bonds, etc.). 
20 Covalent bonds may be, for example, ester, ether, phospho- 
ester, amide, peptide, imide, carbon-sulfur bonds, 
carbon-phosphorus bonds, and the like. 

The term "support" refers to any solid or semi-solid 
25 surface to which a desired binding partner may be 
anchored. Suitable supports include glass, plastic, 
metal, polymer gels, and the like, and may take the form 
of beads, wells, dipsticks, membranes, and the like. 

30 The term "label" as used herein refers to any atom or 
moiety which can be used to provide a detectable 
(preferably quantifiable) signal, and which can be at- 
tached to a polynucleotide or polypeptide. 

35 As used herein, the term "label probe" refers to a 

polynucleotide probe which is comprised of a targeting 
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polynucleotide sequence which is complementary to a target 
sequence to be detected in the analyte polynucleotide. 
This complementary region is of sufficient length and 
complementarily to the target sequence to afford a duplex 
5 comprised of the "label probe" and the "target sequence" 
to be detected by the label. The label probe is coupled 
to a label either directly/ or indirectly via a set of 
ligand molecules with high specificity for each other, 
including multimers. 

10 

The term "multimer," as used herein, refers to linear or 
branched polymers of the same repeating single-stranded 
polynucleotide unit or different single-stranded 
polynucleotide units. At least one of the units has a 
15 sequence, length, and composition that permits it to 
hybridize specifically to a first single-stranded 
nucleotide sequence of interest, typically an analyte or a 
polynucleotide probe (e.g., a label probe) bound to an 
analyte. In order to achieve such specificity and 

2 0 stability, this unit will normally be at least about 15 

nucleotides in length, typically no more than about 50 
nucleotides in length, and preferably about 30 nucleotides 
in length; moreover, the content of Gs and Cs will 
normally be at least about 4 0%, and at most about 60%. In 

25 addition to such unit(s), the multimer includes a 

multiplicity of units that are capable of hybridizing spe- 
cifically and stably to a second single-stranded 
nucleotide of interest, typically a labelled 
polynucleotide or another multimer. These units are 

30 generally about the same size and composition as the 

multimers discussed above. When a multimer is designed to 
be hybridized to another multimer, the first and second 
oligonucleotide units are heterogeneous (different) , and 
do not hybridize with each other under the conditions of 

3 5 the selected assay. Thus, multimers may be label probes, 

or may be ligands which couple the label to the probe. 
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A "replicon" is any genetic element, e.g., a plasmid, a 
chromosome, a virus, a cosmid, etc. that behaves as an 
autonomous unit of polynucleotide replication within a 
5 cell; i.e., capable of replication under its own control. 
This may include selectable markers. 

"PCR" refers to the technique of polymerase chain reaction 
as described in Saiki, et al., Nature 324:163 (1986); and 
10 Scharf et al., Science (1986) 233:1076-1078; and U.S. 
4,683,195; and U.S. 4,683,202. 

As used herein, x is "heterologous" with respect to y if x 
is not naturally associated with y in the identical 
15 manner; i.e., x is not associated with y in nature or x is 
not associated with y in the same manner as is found in 
nature. 

"Homology" refers to the degree of similarity between x 
20 and y. The correspondence between the sequence from one 
form to another can be determined by techniques known in 
the art. For example, they can be determined by a direct 
comparison of the sequence information of the 
polynucleotide. Alternatively, homology can be determined 
25 by hybridization of the polynucleotides under conditions 
which form stable duplexes between homologous regions (for 
example, those which would be used prior to S 1 digestion), 
followed by digestion with single-stranded specific 
nuclease (s), followed by size determination of the 
30 digested fragments. 

A "vector" is a replicon in which another polynucleotide 
segment is attached, so as to bring about the replication 
and/or expression of the attached segment. 

35 

"Control sequence" refers to polynucleotide sequences 
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which are necessary to effect the expression of coding 
sequences to which they are ligated. The nature of such 
control sequences differs depending upon the host 
organism; in prokaryotes, such control sequences generally 
5 include promoter, ribosomal binding site, and 

transcription termination sequence; in eukaryotes, 
generally, such control sequences include promoters and 
transcription termination sequence. The term "control 
sequences" is intended to include, at a minimum, all 
10 components whose presence is necessary for expression, and 
may also include additional components whose presence is 
advantageous, for example, leader sequences and fusion 
partner sequences. 

15 "Operably linked" refers to a juxtaposition wherein the 
components so" described are in a relationship permitting 
them to function in their intended manner. A control 
sequence "operably linked" to a coding sequence is ligated 
in such a way that expression of the coding sequence is 

20 achieved under conditions compatible with the control 
sequences. 

An "open reading frame" (ORF) is a region of a 
polynucleotide sequence which encodes a polypeptide; this 
25 region may represent a portion of a coding sequence or a 
total coding sequence. 

A "coding sequence" is a polynucleotide sequence which is 
translated into a polypeptide, usually via mRNA, when 

30 placed under the control of appropriate regulatory 
sequences. The boundaries of the coding sequence are 
determined by a translation start codon at the 5' -terminus 
and a translation stop codon at the 3 '-terminus* A coding 
sequence can include, but is not limited to, cDNA, and 

35 recombinant polynucleotide sequences. 
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As used herein, the term "polypeptide" refers to a polymer 
of amino acids and does not refer to a specific length of 
the product; thus, peptides, oligopeptides, and proteins 
are included within the definition of polypeptide. This 
5 term also does not refer to or exclude post expression 
modifications of the polypeptide, for example, 
glycosylations, acetylations, phosphorylations and the 
like. Included within the definition are, for example, 
polypeptides containing one or more analogs of an amino 
10 acid (including, for example, unnatural amino acids, 

etc.), polypeptides with substituted linkages, as well as 
other modifications known in the art, both naturally oc- 
curring and non-naturally occurring. 

A polypeptide or amino acid sequence "derived from" a 
designated nucleic acid sequence refers to a polypeptide 
having an amino acid sequence identical to that of a 
polypeptide encoded in the sequence, or a portion thereof 
wherein the portion consists of at least 3-5 amino acids, 
and more preferably at least 8-10 amino acids, and even 
more preferably at least 11-15 amino acids, or which -is 
immunologically identifiable with a polypeptide encoded in 
the sequence. This terminology also includes a 
polypeptide expressed from a designated nucleic acid 
sequence. 

"Immunogenic" refers to the ability of a polypeptide to 
cause a humeral and/or cellular immune response, whether 
alone or when linked to a carrier, in the presence or 
30 absence of an adjuvant. "Neutralization" refers to an 
immune response that blocks the infectivity, either 
partially or fully, of an infectious agent. "Epitope" 
refers to an antigenic determinant of a peptide, 
polypeptide, or protein; an epitope can comprise 3 or more 
35 amino acids in a spatial conformation unique to the 
epitope. Generally, an epitope consists of at least 5 



15 



20 



25 
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such amino acids and, more usually, consists of at least 
8-10 such amino acids. Methods of determining spatial 
conformation of amino acids are known in the art and 
include, for example, x-ray crystallography and 2- 
5 dimensional nuclear magnetic resonance. Antibodies that 
recognize the same epitope can be identified in a simple 
immunoassay showing the ability of one antibody to block 
the binding of another antibody to a target antigen. 

10 "Treatment," as used herein, refers to prophylaxis and/or 
therapy (i.e., the modulation of any disease symptoms). 
An "individual" indicates an animal that is susceptible to 
infection by bacterium possessing an antigenic capsular 
polysaccharide or oligosaccharide structure and includes, 

15 but is not limited to, primates, including humans. A 
"vaccine" is an immunogenic, or otherwise capable of 
eliciting protection against such a bacterium, whether 
partial or complete, composition useful for treatment of 
an individual. 

20 

The conjugate compounds of the invention may be used for 
producing antibodies, either monoclonal or polyclonal, 
specific to the proteins. The methods for producing these 
antibodies are known in the art. 

25 

"Recombinant host cells", "host cells," "cells," "cell 
cultures," and other such terms denote, for example, 
microorganisms, insect cells, and mammalian cells, that 
can be, or have been, used as recipients for recombinant 

30 vector or other transfer DNA, and include the progeny of 
the original cell which has been transformed. It is 
understood that the progeny of a single parental cell may 
not necessarily be completely identical in morphology or 
in genomic or total DNA complement as the original parent, 

35 due to natural, accidental, or deliberate mutation. 

Examples for mammalian host cells include Chinese hamster 
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ovary (CHO) and monkey kidney (COS) cells* 

Specifically, as used herein, "cell line," refers to a 
population of cells capable of continuous or prolonged 
5 growth, and division in vitro. Often, cell lines are 

clonal populations derived from a single progenitor cell. 
It is further known in the art that spontaneous or induced 
changes can occur in karyotype during storage or transfer 
of such clonal populations. Therefore, cells derived from 

10 the cell line referred to may not be precisely identical 
to the ancestral cells or cultures, and the cell line 
referred to includes such variants. The term "cell lines" 
also includes immortalized cells. Preferably, cell lines 
include nonhybrid cell lines or hybridomas to only two 

15 cell types. 

As used herein, the term "microorganism" includes 
prokaryotic and eukaryotic microbial species such as 
bacteria and fungi, the latter including yeast and 
20 filamentous fungi. 

"Transformation", as used herein, refers to the insertion 
of an exogenous polynucleotide into a host cell, 
irrespective of the method used for the insertion, for 
25 example, direct uptake, transduction, f -mating or 

electroporation. The exogenous polynucleotide may be 
maintained as a non-integrated vector, for example, a 
plasmid, or alternatively, may be integrated into the host 
genome. 

30 

By "purified" and "isolated" is meant, when referring to a 
polypeptide or nucleotide sequence, that the indicated 
molecule is present in the substantial absence of other 
biological macromolecules of the same type. The term 
35 "purified" as used herein preferably means at least 75% by 
weight, more preferably at least 85% by weight, more 
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preferably still at least 95% by weight, and most 
preferably at least 98% by weight , of biological 
macromolecules of the same type present (but water, 
buffers, and other small molecules, especially molecules 
5 having a molecular weight of less than 1000, can be 
present) . 

3 > Expression Systems 

10 Once the appropriate heat shock protein coding sequence is 
isolated, it can be expressed in a variety of different 
expression systems; for example those used with mammalian 
cells, baculoviruses, bacteria, and yeast. 

15 3.1. Mammalian Systems 

Mammalian expression systems are known in the art. A 
mammalian promoter is any DNA sequence capable of binding 
mammalian RNA polymerase and initiating the downstream 

20 (3') transcription of a coding sequence (e.g. structural 
gene) into mRNA. A promoter will have a transcription 
initiating region, which is usually placed proximal to the 
5 1 end of the coding sequence, and a TATA box, usually 
located 25-30 base pairs (bp) upstream of the 

25 transcription initiation site. The TATA box is thought to 
direct RNA polymerase II to begin RNA synthesis at the 
correct site. A mammalian promoter will also contain an 
upstream promoter element, usually located within 100 to 
200 bp upstream of the TATA box. An upstream promoter 

30 element determines the rate at which transcription is 
initiated and can act in either orientation, Sambrook et 
al. , Molecular Cloning; A Laboratory Manual, 2nd ed 
(1989) . 



35 Mammalian viral genes are often highly expressed and have 
a broad host range; therefore sequences encoding iriammalian 
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viral genes provide particularly useful promoter 
sequences. Examples include the SV40 early promoter, 
mouse mammary tumor virus LTR promoter, adenovirus major 
late promoter (Ad MLP) , and herpes simplex virus promoter. 
5 In addition, sequences derived from non-viral genes, such 
as the murine metallotheionein gene, also provide useful 
promoter sequences. Expression may be either constitutive 
or regulated (inducible) , depending on the promoter can be 
induced with glucocorticoid in hormone-responsive cells. 

The presence of an enhancer element (enhancer) , combined 
with the promoter elements described above, will usually 
increase expression levels. An enhancer is a regulatory 
DNA sequence that can stimulate transcription up to 1000- 

15 fold when linked to homologous or heterologous promoters, 
with synthesis beginning at the normal RNA start site. 
Enhancers are also active when they are placed upstream or 
downstream from the transcription initiation site, in 
either normal or flipped orientation, or at a distance of 

20 more than 1000 nucleotides from the promoter, Maniatis et 
al., Science 236:1237 (1989); Alberts et al. Molecular 
Biology of the Cell , 2nd ed (1989). Enhancer elements 
derived from viruses may be particularly useful, because 
they usually have a broader host range. Examples include 

25 the SV40 early gene enhancer, Dijkema et al (1985) EMBO J. 
4:761, and the enhancer/promoters derived from the long 
terminal repeat (LTR) of the Rous Sarcoma Virus, Gorman et 
al. (1982) Proc. Natl. Acad. Sci. 79:6777, and from human 
cytomegalovirus, Boshart et al. (1985) Cell 41:5221. 

3 0 Additionally, some enhancers are regulatable and become 
active only in the presence of an inducer, such as a 
hormone or metal ion, Sassone-Corsi et al. (1986) Trends 
Genet. 2:215; Maniatis et al. (1987) Science 236:1237. 

35 A DNA molecule may be expressed intracellularly in 
mammalian cells. A promoter sequence may be directly 



WO 93/17712 PCT/EP93/00516 



23 



linked with the DNA molecule, in which case the first 
amino acid at the N-terminus of the recombinant protein 
will always be a methionine, which is encoded by the ATG 
start codon. If desired, the N-terminus may be cleaved 
5 from the protein by in vitro incubation with cyanogen 
bromide. 

Alternatively, foreign proteins can also be secreted from 
the cell into the growth media by creating chimeric DNA 

10 molecules that encode a fusion protein comprised of a 
leader sequence fragment that provides for secretion of 
the foreign protein in mammalian cells. Preferably, there 
are processing sites encoded between the leader fragment 
and the foreign gene that can be cleaved either in vivo or 

15 in vitro . The leader sequence fragment usually encodes a 
signal peptide comprised of hydrophobic amino acids which 
direct the secretion of the protein from the cell. The 
adenovirus tripartite leader is an example of a leader 
sequence that provides for secretion of a foreign protein 

20 in mammalian cells. 

Usually, transcription termination and polyadenylation 
sequences recognized by mammalian cells are regulatory 
regions located 3' to the translation stop codon and thus, 

25 together with the promoter elements, flank the coding 

sequence . The 3 1 terminus of the mature mRNA is f ormed by 
site-specific post-transcript ional cleavage and polya- 
denylation, Birnstiel et al. (1985) Cell 41:349; Proudfoot 
and Whitelaw (1988) "Termination and 3 1 end processing of 

3 0 eukaryotic RNA. In Transcription and splicing (ed. B.D. 
Hames and D.M. Glover); Proudfoot (1989) Trends Biochem. 
Sci. 14:105. These sequences direct the transcription of 
an mRNA which can be translated into the polypeptide 
encoded by the DNA. Examples of transcription 

35 terminator/polyadenylation signals include those derived 
from SV40, Sambrook et al (1989), Molecular Cloning: A 
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Laboratory Manual. 

Some genes may be expressed more efficiently when introns 
(also called intervening sequences) are present. Several 
5 cDNAs , however, have been efficiently expressed from 
vectors that lack splicing signals (also called splice 
donor and acceptor sites), see e.g., Gething and Sambrook 
(1981) Nature 293:620. Introns are intervening noncoding 
sequences within a coding sequence that contain splice 

10 donor and acceptor sites. They are removed by a process 
called "splicing," following polyadenylation of the 
primary transcript, Nevins (1983) Annu. Rev. Biochem. 
52:441; Green (1986) Annu. Rev. Genet. 20:671; Padgett et 
al. (1986) Annu. Rev. Biochem. 55:1119; Krainer and 

15 Maniatis (1988) "RNA splicing," In Transcription and 
splicing (ed. B.D. Hames and D.M. Glover). 

Usually, the above-described components, comprising a 
promoter, polyadenylation signal, and* transcription 

20 termination sequence are put together into expression 
constructs. Enhancers, introns with functional splice 
donor and acceptor sites, and leader sequences may also be 
included in an expression construct, if desired. 
Expression constructs are often maintained in a replicon, 

25 such as an extrachromosomal element (e.g., plasmids) 

capable of stable maintenance in a host, such as mammalian 
cells or bacteria. Mammalian replication systems include 
those derived from animal viruses, which require trans- 
acting factors to replicate. For example, plasmids 

30 containing the replication systems of papova viruses, such 
as SV40, Gluzman (1981) Cell 23:175, or polyomavirus, 
replicate to extremely high copy number in the presence of 
the appropriate viral T antigen. Additional examples of 
mammalian replicons include those derived from bovine 

35 papillomavirus and Epstein-Barr virus. Additionally, the 
replicon may have two replication systems, thus allowing 
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it to be maintained, for example, in mammalian cells for 
expression and in a procaryotic host for cloning and 
amplification. Examples of such mammalian-bacteria 
shuttle vectors include pMT2, Kaufman et al. (1989) Mol. 
5 Cell. Biol. 9:946, and pHEBO, Shimizu et al. (1986) Mol. 
Cell. Biol. 6:1074. 

The transformation procedure used depends upon the host to 
be transformed. Methods for introduction of heterologous 

10 polynucleotides into mammalian cells are known in the art 
and include dextran-mediated transfection, calcium 
phosphate precipitation, polybrene mediated transfection, 
protoplast fusion, electroporation, encapsulation of the 
polynucleotide (s) in liposomes, and direct microinjection 

15 of the DNA into nuclei. 

Mammalian cell lines available as hosts for expression are 
known in the art and include many immortalized cell lines 
available from the American Type Culture Collection 
20 (ATCC) , including but not limited to, Chinese hamster 
ovary (CEO) cells, HeLa cells, baby hamster kidney (BHK) 
cells, monkey kidney cells (COS) , human hepatocellular 
carcinoma cells (e.g., Hep G2) , and a number of other cell 
lines. 

25 

ii. Baculovirus Systems 

The polynucleotide encoding the protein can also be 
inserted into a suitable insect expression vector, and is 
30 operably linked to the control elements within that 

vector. Vector construction employs techniques which are 
known in the art. 

Generally, the components of the expression system include 
35 a transfer vector, usually a bacterial plasmid, which 

contains both a fragment of the baculovirus genome, and a 
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convenient restriction site for insertion of the 
heterologous gene or genes to be expressed; a wild type 
baculovirus with a sequence homologous to the baculovirus - 
specific fragment in the transfer vector (this allows for 
5 the homologous recombination of the heterologous gene in 
to the baculovirus genome) ; and appropriate insect host 
cells and growth media. 

After inserting the DNA sequence encoding the protein into 
10 the transfer vector, the vector and the wild type viral 
genome are transfected into an insect host cell where the 
vector and viral genome are allowed to recombine. The 
packaged recombinant virus is expressed and recombinant 
plaques are identified and purified. Materials and 
15 methods for baculovirus/ insect cell expression systems are 
commercially available in kit form from, inter alia , 
Invitrogen, San Diego CA ("MaxBac" kit) . These techniques 
are generally known to those skilled in the art and fully 
described in Summers and Smith, Texas Agricultural 
20 Experiment Station Bulletin No. 1555 (1987) (hereinafter 
"Summers and Smith") . 

Prior to inserting the DNA sequence encoding the protein 
into the baculovirus genome, the above-described 

25 components, comprising a promoter, leader (if desired), 

coding sequence of interest, and transcription termination 
sequence, are usually assembled into an intermediate 
transplacement construct (transfer vector) . This 
construct may contain a single gene and operably linked 

30 regulatory elements; multiple genes, each with its owned 
set of operably linked regulatory elements; or multiple 
genes, regulated by the same set of regulatory elements. 
Intermediate transplacement constructs are often 
maintained in a replicon, such as an extrachromosomal 

35 element (e.g., plasmids) capable of stable maintenance in 
a host, such as a bacterium. The replicon will have a 
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replication system, thus allowing it to be maintained in a 
suitable host for cloning and amplification. 

Currently, the most commonly used transfer vector for 
5 introducing foreign genes into AcNPV is pAc373. Many 
other vectors, known to those of skill in the art, have 
also been designed. These include, for example, pVL985 
(which alters the polyhedrin start codon from ATG to ATT, 
and which introduces a BamHI cloning site 32 basepairs 
10 downstream from the ATT; see Luckow and Summers, Virology 
(1989) 17:31. 

The plasmid usually also contains the polyhedron 
polyadenylation signal (Miller et al. (1988) Ann. Rev. 
15 Microbiol., 42:177) and a procaryotic ampicillin- 
resistance (amp) gene and origin of replication for 
selection and propagation in E. coli . 

Baculovirus transfer vectors usually contain a baculovirus 
20 promoter. A baculovirus promoter is any DNA sequence 
capable of binding a baculovirus RNA polymerase and 
initiating the downstream (5' to 3') transcription of a 
coding sequence (e.g. structural gene) into mRNA. A 
promoter will have a transcription initiation region which 
25 is usually placed proximal to the 5 1 end of the coding 
sequence. This transcription initiation region usually 
includes an RNA polymerase binding site and a 
transcription initiation site. A baculovirus transfer 
vector may also have a second domain called an enhancer, 
30 which, if present, is usually distal to the structural 

gene. Expression may be either regulated or constitutive. 

Structural genes, abundantly transcribed at late times in 
a viral infection cycle, provide particularly useful 
35 promoter sequences. Examples include sequences derived 
from the gene encoding the viral polyhedron protein, 
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Friesen et al., (1986) "The Regulation of Baculovirus Gene 
Expression, 11 in: The Molecular Biology of Baculoviruses 
(ed. Walter Doerf ler) ; EPO Publ. Nos. 127 839 and 155 476; 
and the gene encoding the plO protein, Vlak et al., 
5 (1988), J. Gen. Virol. 69:765. 

DNA encoding suitable signal sequences can be derived from 
genes for secreted insect or baculovirus proteins, such as 
the baculovirus polyhedrin gene (Carbonell et al. (1988) 

10 Gene, 73:409). Alternatively, since the signals for 
mammalian cell posttranslational modifications (such as 
signal peptide cleavage, proteolytic cleavage, and 
phosphorylation) appear to be recognized by insect cells, 
and the signals required for secretion and nuclear 

15 accumulation also appear to be conserved between the 

invertebrate cells and vertebrate cells, leaders of non- 
insect origin, such as those derived from genes encoding 
human a- interferon, Maeda et al., (1985), Nature 315:592; 
human gastrin-releasing peptide, Lebacq-Verheyden et al. , 

20 (1988), Molec. Cell. Biol. 8:3129; human IL-2, Smith et 

al., (1985) Proc. Nat'l Acad. Sci. USA, 82:8404; mouse IL- 
3, (Miyajima et al., (1987) Gene 58:273; and human 
glucocerebrosidase, Martin et al. (1988) DNA 7:99, can 
also be used to provide for secretion in insects. 

25 

A recombinant polypeptide or polyprotein may be expressed 
intracellular ly or, if it is expressed with the proper 
regulatory sequences, it can be secreted. Good 
intracellular expression of nonfused foreign proteins 

30 usually requires heterologous genes that ideally have a 
short leader sequence containing suitable translation 
initiation signals preceding an ATG start signal. If 
desired, methionine at the N-terminus may be cleaved from 
the mature protein by in vitro incubation with cyanogen 

35 bromide. 
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Alternatively, recombinant polyproteins or proteins which 
are not naturally secreted can be secreted from the insect 
cell by creating chimeric DNA molecules that encode a 
fusion protein comprised of a leader sequence fragment 
5 that provides for secretion of the foreign protein in 
insects. The leader sequence fragment usually encodes a 
signal peptide comprised of hydrophobic amino acids which 
direct the translocation of the protein into the 
endoplasmic reticulum. 

10 

After insertion of the DNA sequence and/ or the gene 
encoding the expression product precursor of the protein, 
an insect cell host is co-transformed with the 
heterologous DNA of the transfer vector and the genomic 

15 DNA of wild type baculovirus — usually by co- 

transfection. The promoter and transcription termination 
sequence of the construct will usually comprise a 2-5kb 
section of the baculovirus genome. Methods for 
introducing heterologous DNA into the desired site in the 

20 baculovirus virus are known in the art. (See Summers and 
Smith; Ju et al. (1987); Smith et al., Mol. Cell. Biol. 
(1983) 3:2156; and Luckow and Summers (1989)). For 
example, the insertion can be into a gene such as the 
polyhedrin .gene, by homologous double crossover 

25 recombination; insertion can also be into a restriction 
enzyme site engineered into the desired baculovirus gene. 
Miller et al., (1989), Bioessays 4:91. 

The DNA sequence, when cloned in place of the polyhedrin 
30 gene in the expression vector, is flanked both 5' and 3' 
by polyhedrin-specif ic sequences and is positioned 
downstream of the polyhedrin promoter. 

The newly formed baculovirus expression vector is 
35 subsequently packaged into an infectious recombinant 
baculovirus. Homologous recombination occurs at low 
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frequency (between about 1% and about 5%); thus, the 
majority of the virus produced after cotransfection is 
still wild- type virus. Therefore, a method is necessary 
to identify recombinant viruses. An advantage of the 
5 expression system is a visual screen allowing recombinant 
viruses to be distinguished. The polyhedrin protein, 
which is produced by the native virus, is produced at very 
high levels in the nuclei of infected cells at late times 
after viral infection. Accumulated polyhedrin protein 

10 forms occlusion bodies that also contain embedded 

particles. These occlusion bodies, up to 15 /im in size, 
are highly refractile, giving them a bright shiny 
appearance that is readily visualized under the light 
microscope. Cells infected with recombinant viruses lack 

15 occlusion bodies. To distinguish recombinant virus from 
wild-type virus, the transfection supernatant is plaqued 
onto a monolayer of insect cells by techniques known to 
those skilled in the art. Namely, the plaques are 
screened under the light microscope for the presence 

20 (indicative of wild-type virus) or absence (indicative of 
recombinant virus) of occlusion bodies. "Current 
Protocols in Microbiology" Vol. 2 (Ausubel et al. eds) at 
16.8 (Supp. .10, 1990); Summers and Smith; Miller et al. 
(1989) . 

25 

Recombinant baculovirus expression vectors have been 
developed for infection into several insect cells. For 
example, recombinant baculoviruses have been developed 
for, inter alia : Aedes aegypti , Autocrrapha calif ornica . 

30 Bombyx mori , Drosophila melanoqaster , Spodoptera 
f ruaiperda , and Trichoplusia ni (PCT Pub. No. WO 
89/046699; Carbonell et al. , (1985) J. Virol. 56:153; 
Wright (1986) Nature 321:718; Smith et al., (1983) Mol. 
Cell. Biol. 3:2156; and see generally, Fraser, et al. 

35 (1989) In Vitro Cell. Dev. Biol. 25:225). 
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Cells and cell culture media are commercially available 
for both direct and fusion expression of heterologous 
polypeptides in a baculovirus/expression system; cell 
culture technology is generally known to those skilled in 
5 the art. See, e.g., Summers and Smith. 

The modified insect cells may then be grown in an 
appropriate nutrient medium, which allows for stable 
maintenance of the plasmid(s) present in the modified 

10 insect host. Where the expression product gene is under 
inducible control, the host may be grown to high density, 
and expression induced. Alternatively, where expression 
is constitutive, the product will be continuously 
expressed into the medium and the nutrient medium must be 

15 continuously circulated, while removing the product of 
interest and augmenting depleted nutrients. The product 
may be purified by such techniques as chromatography, 
e.g., HPLC, affinity chromatography, ion exchange 
chromatography, etc.; electrophoresis'; density gradient 

20 centrifugation; solvent extraction, or the like. As 
appropriate, the product may be further purified, as 
required, so as to remove substantially any insect 
proteins which are also secreted in the medium or result 
from lysis of insect cells, so as to provide a product 

25 which is at least substantially free of host debris, e.g., 
proteins, lipids and polysaccharides. 

In order to obtain protein expression, recombinant host 
cells derived from the transf ormants are incubated under 
30 conditions which allow expression of the recombinant 
protein encoding sequence. These conditions will vary, 
dependent upon the host cell selected. However, the 
conditions are readily ascertainable to those of ordinary 
skill in the art, based upon what is known in the art. 



35 



iii. Bacterial Systems 



WO 93/17712 PCI7EP93/00516 



32 



Bacterial expression techniques are known in the art. A 
bacterial promoter is any DNA sequence capable of binding 
bacterial RNA polymerase and initiating the downstream 
5 (3") transcription of a coding sequence (e.g. structural 
gene) into mRNA. A promoter will have a transcription 
initiation region which is usually placed proximal to the 
5' end of the coding sequence. This transcription 
initiation region usually includes an RNA polymerase 

10 binding site and a transcription initiation site. A 

bacterial promoter may also have a second domain called an 
operator, that may overlap an adjacent RNA polymerase 
binding site at which RNA synthesis begins. The operator 
permits negative regulated (inducible) transcription, as a 

15 gene repressor protein may bind the operator and thereby 
inhibit transcription of a specific gene. Constitutive 
expression may occur in the absence of negative regulatory 
elements, such as the operator. In addition, positive 
regulation may be achieved by a gene activator protein 

20 binding sequence, which, if present is usually proximal 
(5 f ) to the RNA polymerase binding sequence. An example 
of a gene activator protein is the catabolite activator 
protein (CAP) , which helps initiate transcription of the 
lac operon in E. coli , Raibaud et al. (1984) Annu. Rev. 

25 Genet. 18:173. Regulated expression may therefore be 
either positive or negative, thereby either enhancing or 
reducing transcription. 

Sequences encoding metabolic pathway enzymes provide 
30 particularly useful promoter sequences. Examples include 
promoter sequences derived from sugar metabolizing 
enzymes, such as galactose, lactose ( lac ) , Chang et al. 
(1977) Nature 198:1056, and maltose. Additional examples 
include promoter sequences derived from biosynthetic 
35 enzymes such as tryptophan (trp) , Goeddel et al. (1980) 
Nuc. Acids Res. 8:4057; Yelverton et al. (1981) Nucl, 
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Acids Res. 9:731; U.S. 4,738,921; EPO Publ. Nos. 036 776 
and 121 775. The g-laotamase ( bla ) promoter system, 
Weissmann (1981) "The cloning of interferon and other 
mistakes." In Interferon 3 (ed. I. Gresser) , 
5 bacteriophage lambda PL, Shimatake et al. (1981) Nature 
292:128, and T5, U.S. 4,689,406, promoter systems also 
provide useful promoter sequences. 

In addition, synthetic promoters which do not occur in 

10 nature also function as bacterial promoters. For example, 
transcription activation sequences of one bacterial or 
bacteriophage promoter may be joined with the operon 
sequences of another bacterial or bacteriophage promoter, 
creating a synthetic hybrid promoter, U.S. 4,551,433. For 

15 example, the tac promoter is a hybrid trp - lac promoter 
comprised of both trp promoter and lac operon sequences 
that is regulated by the lac repressor, Amann et al. 
(1983) Gene 25:167; de Boer et al. (1983) Proc. Natl. 
Acad. Sci. 80:21. Furthermore, a bacterial promoter can 

20 include naturally occurring promoters of non-bacterial 
origin that have the ability to bind bacterial RNA 
polymerase and initiate transcription. A naturally 
occurring promoter of non-bacterial origin can also be 
coupled with a compatible RNA polymerase to produce high 

25 levels of expression of some genes in prokaryotes. The 
bacteriophage T7 RNA polymerase/promoter system is an 
example of a coupled promoter system, Studier et al. 
(1986) J. Mol. Biol. 189:113; Tabor et al. (1985) Proc 
Natl. Acad. Sci. 82:1074. In addition, a hybrid promoter 

30 can also be comprised of a bacteriophage promoter and an 
E. coli operator region (EPO Publ. No. 267 851). 

In addition to a functioning promoter sequence, an 
efficient ribosome binding site is also useful for the 
35 expression of foreign genes in prokaryotes. In E. coli r 
the ribosome binding site is called the Shine-Dalgarno 
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(SD) sequence and includes an initiation codon (ATG) and a 
sequence 3-9 nucleotides in length located 3-11 
nucleotides upstream of the initiation codon, Shine et al. 
(1975) Nature 254:34, The SD sequence is thought to 
5 promote binding of mRNA to the ribosome by the pairing of 
bases between the SD sequence and the 3 1 and of E. coli 
16S rRNA, Steitz et al. (1979) "Genetic signals and 
nucleotide sequences in messenger RNA. 11 In Biological 
Regulation and Development: Gene Expression (ed. R.F. 
. 10 Goldberger) . To express eukaryotic genes and prokaryotic 
genes with weak ribosome-binding site, Sambrook et al. • 
(1989) , Molecular Cloning: A Laboratory Manual . 

A DNA molecule may be expressed intracellular ly. A 
15 promoter sequence may be directly linked with the DNA 
molecule, in which case the first amino acid at the N- 
terminus will always be a methionine, which is encoded by 
the ATG start codon. If desired, methionine at the N- 
terminus may be cleaved from the protein by in vitro 
20 incubation with cyanogen bromide or by either in vivo on 
in vitro incubation with a bacterial methionine N-terminal 
peptidase (EPO Publ. No. 219 237). 

Fusion proteins provide an alternative to direct 
25 expression. Usually, a DNA sequence encoding the N- 

terminal portion of an endogenous bacterial protein, or 
other stable protein, is fused to the 5 1 end of 
heterologous coding sequences. Upon expression, this 
construct will provide a fusion of the two amino acid 
30 sequences. For example, the bacteriophage lambda cell 
gene can be linked at the 5 1 terminus of a foreign gene 
and expressed in bacteria. The resulting fusion protein 
preferably retains a site for a processing enzyme (factor 
Xa) to cleave the bacteriophage protein from the foreign 
35 gene, Nagai et al. (1984) Nature 309:810. Fusion proteins 
can also be made with sequences from the lacZ, Jia et al. 
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(1987) Gene 60:197, trpE . Allen et al. (1987) J. 
Biotechnol. 5:93; Makoff et al. (1989) J. Gen. Microbiol. 
135:11, and EPO Publ. No. 324 647 f genes. The DNA 
sequence at the junction of the two amino acid sequences 
5 may or may not encode a cleavable site. Another example 
is a ubiquitin fusion protein. Such a fusion protein is 
made with the ubiquitin region that preferably retains a 
site for a processing enzyme (e.g. ubiquitin specific 
processing-protease) to cleave the ubiquitin from the 
10 foreign protein. Through this method, native foreign 
protein can be isolated. Miller et al. (1989) 
Bio/Technology 7:698. 

Alternatively, foreign proteins can also be secreted from 
15 the cell by creating chimeric DNA molecules that encode a 
fusion protein comprised of a signal peptide sequence 
fragment that provides for secretion of the foreign 
protein in bacteria, U.S. 4,336,336. The signal sequence 
fragment usually encodes a signal peptide comprised of 
20 hydrophobic amino acids which direct the secretion of the 
protein from the cell. The protein is either secreted 
into the growth media (gram-positive bacteria) or into the 
periplasmic space, located between the inner and outer 
membrane of the cell (gram-negative bacteria) . Preferably 
25 there are processing sites, which can be cleaved either in 
vivo or in vitro encoded between the signal peptide 
fragment and the foreign gene. 

DNA encoding suitable signal sequences can be derived from 
30 genes for secreted bacterial proteins, such as the E. coli 
outer membrane protein gene ( ompA ) . Masui et al. (1983), 
in: Experimental Manipulation of Gene Expression ? Ghrayeb 
et al. (1984) EMBO J. 3:2437 and the E. coli alkaline 
phosphatase signal sequence ( phoA ) , Oka et al. (1985) 
35 Proc. Natl. Acad. Sci. 82:7212. As an additional example, 
the signal sequence of the alpha-amylase gene from various 
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Bacillus strains can be used to secrete heterologous 
proteins from B. subtilis . Palva et al. (1982) Proc. Natl. 
Acad. Sci. USA 79:5582; EPO Publ. No. 244 042. 

5 Usually, transcription termination sequences recognized by 
bacteria are regulatory regions located 3 1 to the 
translation stop codon, and thus together with the 
promoter flank the coding sequence. These sequences 
direct the transcription of an mRNA which can be 

10 translated into the polypeptide encoded by the DNA. 

Transcription termination sequences frequently include DNA 
sequences of about 50 nucleotides capable of forming stem 
loop structures that aid in terminating transcription. 
Examples include transcription termination sequences 

15 derived from genes with strong promoters, such as the trp 
gene in E. coli as well as other biosynthetic genes* 

Usually, the above-described components, comprising a 
promoter, signal sequence (if desired) , coding sequence of 

20 interest, and transcription termination sequence, are put 
together into expression constructs. Expression 
constructs are often maintained in a replicon, such as an 
extrachromosomal element (e.g., plasmids) capable of 
stable maintenance in a host, such as bacteria. The 

25 replicon will have a replication system, thus allowing it 
to be maintained in a procaryotic host either for 
expression or for cloning and amplification. In addition, 
a replicon may be either a high or low copy number 
plasmid. A high copy number plasmid will generally have a 

30 copy number ranging from about 5 to about 200, and usually 
about 10 to about 150. A host containing a high copy 
number plasmid will preferably contain at least about 10, 
and more preferably at least about 20 plasmids. Either a 
high or low copy number vector may be selected, depending 

35 upon the effect of the vector and the foreign protein on 
the host. 
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Alternatively, the expression constructs can be integrated 
into the bacterial genome with an integrating vector. 
Integrating vectors usually contain at least one sequence 
5 homologous to the bacterial chromosome that allows the 
vector to integrate. Integrations appear to result from 
recombinations between homologous DNA in the vector and 
the bacterial chromosome. For example, integrating 
vectors constructed with DNA from various Bacillus strains 
10 integrate into the Bacillus chromosome (EPO Publ. No. 127 
328) . Integrating vectors may also be comprised of 
bacteriophage or transposon sequences. 

Usually, extrachromosomal and integrating expression 
15 constructs may certain selectable markers to allow for the 
selection of bacterial strains that have been transformed. 
Selectable markers can be expressed in the bacterial host 
and may include genes which render bacteria resistant to 
drugs such as ampicillin, chloramphenicol, erythromycin, 
20 kanamycin (neomycin), and tetracycline, Davies et al. 
(1978) Annu. Rev. Microbiol - 32:469. Selectable markers 
may also include biosynthetic genes, such as those in the 
histidine, tryptophan, and leucine biosynthetic pathways. 

25 Alternatively, some of the above-described components can 
be put together in transformation vectors. Transformation 
vectors are usually comprised of a selectable marker that 
is either maintained in a replicon or developed into an 
integrating vector. 

30 

Expression and transformation vectors, either extra- 
chromosomal replicons or integrating vectors, have been 
developed for transformation into many bacteria. For 
example, expression vectors have been developed for, inter 
35 alia, the following bacteria: Bacillus subtilis . Palv et 
al. (1982) Proc. Natl. Acad. Sci. USA 79:5582; EPO Publ. 
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Nos. 036 259 and 063 953; PCT Publ. No. WO 84/04541; E± 
coli, Shimatake et al. (1981) Nature 292:128; Amann et al. 
(1985) Gene 40:183; Studier et al. (1986) J. Mol. Biol. 
189:113; EPO Publ. Nos. 036 776, 136 829 and 136 907; 
5 Streptococcus cremoris , Powell et al. (1988) Appl. 

Environ. Microbiol. 54:655; Streptococcus lividans , Powell 
et al. (1988) Appl. Environ. Microbiol. 54:655; and 
Streptomvces lividans , U.S. 4,745,056. 

10 Methods of introducing exogenous DNA into bacterial hosts 
are well-known in the art, and usually include either the ' 
transformation of bacteria treated with CaCl 2 or other 
agents, such as divalent cations and DMSO. DNA can also 
be introduced into bacterial cells by electroporation. 

15 Transformation procedures usually vary with the bacterial 
species to be transformed. See, e.g., Masson et al. 
(1989) FEMS Microbiol. Lett. 60:273; Palva et al. (1982) 
Proc. Natl. Acad. Sci. USA 79:5582; EPO Publ. Nos. 036 259 
and 063 953; PCT Publ. No. WO 84/04541, for Bacillus ; 

20 Miller et al. (1988) Proc. Natl. Acad. Sci. 85:856; Wang 
et al. (1990) J. Bacteriol. 172:949, for Campylobacter ; 
Cohen et al. (1973) Proc. Natl. Acad. Sci. 69:2110; Dower 
et al. (1988) Nucleic Acids Res. 16:6127; Kushner (1978) 
"An improved method for transformation of E. coli with 

25 ColEl-derived plasmids," In Genetic Engineering: 

Proceedings of the International Symposium on Genetic 
Engineering (eds. H.W. Boyer and S. Nicosia); Mandel et 
al. (1970) J. Mol. Biol. 53:159; Taketo (1988) Biochim. 
Biophys. Acta 949:318, for Escherichia ; Chassy et al. 

30 (1987) FEMS Microbiol. Lett. 44:173, for Lactobacillus ; 
Fiedler et al. (1988) Anal. Biochem 170:38, for 
Pseudomonas ; Augustin et al. (1990) FEMS Microbiol. Lett. 
66:203, for Staphylococcus ; Barany et al. (1980) J. 
Bacteriol. 144:698; Har lander (1987) "Transformation of 

35 Streptococcus lactis by electroporation, in: Streptococcal 
Genetics (ed. J. Ferretti and R. Curtiss III) ; Perry et 
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al. (1981) Infec. Immun. 32:1295; Powell et al. (1988) 
Appl. Environ. Microbiol. 54:655; Somkuti et al. (1987) 
Proc. 4th Evr. Cong. Biotechnology 1:412, for 
Streptococcus . 

5 

iVa. Yeast Expression 

Yeast expression systems are also known to one of ordinary 
skill in the art. A yeast promoter is any DNA sequence 

10 capable of binding yeast RNA polymerase and initiating the 
downstream (3 1 ) transcription of a coding sequence (e.g. 
structural gene) into mRNA. A promoter will have a 
transcription initiation region which is usually placed 
proximal to the 5' end of the coding sequence. This 

15 transcription initiation region usually includes an RNA 
polymerase binding site (the "TATA Box") and a 
transcription initiation site. A yeast promoter may also 
have a second domain called an upstream activator sequence 
(UAS) , which, if present, is usually distal to the 

20 structural gene. The UAS permits regulated (inducible) 

expression. Constitutive expression occurs in the absence 
of a UAS. Regulated expression may be either positive or 
negative, thereby either enhancing or reducing 
transcription. 

25 

Yeast is a fermenting organism with an active metabolic 
pathway, therefore sequences encoding enzymes in the 
metabolic pathway provide particularly useful promoter 
sequences. Examples include alcohol dehydrogenase (ADH) 

30 (EPO Publ. No. 284 044), enolase, glucokinase, glucose-6- 
phosphate isomerase, glyceraldehyde-3-phosphate- 
dehydrogenase (GAP or GAPDH) , hexokinase, 
phosphofructokinase, 3 -phosphogly cerate mutase, and 
pyruvate kinase (PyK) (EPO Publ. No. 329 203). The yeast 

3 5 PH05 gene, encoding acid phosphatase, also provides useful 
promoter sequences, Myanohara et al. (1983) Proc. Natl. 
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Acad. Sci. USA 80:1. 



In addition, synthetic promoters which do not occur in 
nature also function as yeast promoters. For example, UAS 
5 sequences of one yeast promoter may be joined with the 
transcription activation region of another yeast promoter, 
creating a synthetic hybrid promoter. Examples of such 
hybrid promoters include the ADH regulatory sequence 
linked to the GAP transcription activation region (U.S. 

10 4,876,197 and U.S. 4,880,734). Other examples of hybrid 
promoters include promoters which consist of the 
regulatory sequences of either the ADH2, GAL4, GAL10 , or 
PH05 genes, combined with the transcriptional activation 
region of a glycolytic enzyme gene such as GAP or PyK (EPO 

15 Publ/No. 164 556). Furthermore, a yeast promoter can 
include naturally occurring promoters of non-yeast origin 
that have the ability to bind yeast RNA polymerase and 
initiate transcription. Examples of such promoters 
include, inter alia . Cohen et al. (1980) Proc. Natl. Acad. 

20 Sci. USA 77:1078; Henikoff et al. (1981) Nature 283:835; 
Hollenberg et al. (1981) Curr. Topics Microbiol. Immunol. 
96:119; Hollenberg et al. (1979) "The Expression of 
Bacterial Antibiotic Resistance Genes in the Yeast 
Saccharomyc.es cerevisiae, " in: Plasmids of Medical, 

25 Environmental and Commercial Importance (eds. K.N. Timmis 
and A. Puhler) ; Mercerau-Puigalon et al. (1980) Gene 
11:163; Panthier et al. (1980) Curr. Genet. 2:109. 

A DNA molecule may be expressed intracellularly in yeast. 

30 A promoter sequence may be directly linked with the DNA 
molecule, in which case the first amino acid at the N- 
terminus of the recombinant protein will always be a 
methionine, which is encoded by the ATG start codon. If 
desired, methionine at the N-terminus may be cleaved from 

35 the protein by in vitro incubation with cyanogen bromide. 
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Fusion proteins provide an alternative for yeast 
expression systems, as well as in mammalian, baculovirus, 
and bacterial expression systems. Usually, a DNA sequence 
encoding the N-terminal portion of an endogenous yeast 
5 protein, or other stable protein, is fused to the 5 f end 
of heterologous coding sequences. Upon expression, this 
construct will provide a fusion of the two amino acid 
sequences. For example, the yeast or human superoxide 
dismutase (SOD) gene, can be linked at the 5 1 terminus of 

10 a foreign gene and expressed in yeast. The DNA sequence 
at the junction of the two amino acid sequences may or may 
not encode a cleavable site. See e.g., EPO Publ. No. 196 
056. Another example is a ubiquitin fusion protein. Such 
a fusion protein is made with the ubiquitin region that 

15 preferably retains a site for a processing enzyme (e.g. 
ubiquitin-specif ic processing protease) to cleave the 
ubiquitin from the foreign protein. Through this method, 
therefore, native foreign protein can be isolated (see, 
e.g., PCT Publ. No. WO 88/024066). 

20 

Alternatively, foreign proteins can also be secreted from 
the cell into the growth media by creating chimeric DNA 
molecules that encode a fusion protein comprised of a 
leader sequence fragment that provide for secretion in 

25 yeast of the foreign protein. Preferably, there are 

processing sites encoded between the leader fragment and 
the foreign gene that can be cleaved either in vivo or in 
vitro . The leader sequence fragment usually encodes a 
signal peptide comprised of hydrophobic amino acids which 

30 direct the secretion of the protein from the cell. 

DNA encoding suitable signal sequences can be derived from 
genes for secreted yeast proteins, such as the yeast 
invertase gene (EPO Publ. No. 012 873; JPO Publ. No. 
35' 62,096,086) and the A-factor gene (U.S. 4,588,684). 

Alternatively, leaders of non-yeast origin, such as an 
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interferon leader, exist that also provide for secretion 
in yeast (EPO Publ. No. 060 057). 

A preferred class of secretion leaders are those that 
5 employ a fragment of the yeast alpha-factor gene f which 
contains both a "pre" signal sequence, and a "pro" region. 
The types of alpha-factor fragments that can be employed 
include the full-length pre-pro alpha factor leader (about 
83 amino acid residues) as well as truncated alpha-factor 

10 leaders (usually about 25 to about 50 amino acid residues) 
(U.S. 4,546,083 and U.S. 4,870,008; EPO Publ. No. 324 
274) . Additional leaders employing an alpha-factor leader 
fragment that provides for secretion include hybrid alpha- 
factor leaders made with a presequence of a first yeast, 

15 but a pro-region from a second yeast alphaf actor. (See 
e.g., PCT Publ. No. WO 89/02463.) 

Usually, transcription termination sequences recognized by 
yeast are regulatory regions located 3 ' to the translation 

20 stop codon, and thus together with the promoter flank the 
coding sequence. These sequences direct the transcription 
of an mRNA which can be translated into the polypeptide 
encoded by the DNA. Examples of transcription terminator 
sequence and other yeast-recognized termination sequences, 

25 such as those coding for glycolytic enzymes. 

Usually, the above-described components, comprising a 
promoter, leader (if desired) , coding sequence of 
interest, and transcription termination sequence, are put 

30 together into expression constructs. Expression 

constructs are often maintained in a replicon, such as an 
extra chromosomal element (e.g., plasmids) capable of 
stable maintenance in a host, such as yeast or bacteria. 
The replicon may have two replication systems, thus 

35 allowing it to be maintained, for example, in yeast for 
expression and in a procaryotic host for cloning and 
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amplification. Examples of such yeast-bacteria shuttle 
vectors include YEp24, Botstein et al. (1979) Gene 8:17- 
24; pCl/1, Brake et al. (1984) Proc. Natl. Acad. Sci USA 
81:4642-4646; and YRpl7, Stinchcomb et al. (1982) J. Mol. 
5 Biol. 158:157. In addition, a replicon may be either a 
high or low copy number plasmid. A high copy number 
plasmid will generally have a copy number ranging from 
about 5 to about 200, and usually about 10 to about 150. 
A host containing a high copy number plasmid will 
10 preferably have at least about 10, and more preferably at 
least about 20. A high or low copy number vector may be 
selected, depending upon the effect of the vector and the 
foreign protein on the host. 

15 Alternatively, the expression constructs can be integrated 
into the yeast genome with an integrating vector. 
Integrating vectors usually contain at least one sequence 
homologous to a yeast chromosome that allows the vector to 
integrate, and preferably contain two homologous sequences 

20 flanking the expression construct. Integrations appear to 
result from recombinations between homologous DNA in the 
vector and the yeast chromosome, Orr-Weaver et al. (1983) 
Methods in Enzymol. 101:228-245. An integrating vector 
may be directed to a specific locus in yeast by selecting 

25 the appropriate homologous sequence for inclusion in the 
vector. One or more expression construct may integrate, 
possibly affecting levels of recombinant protein produced, 
Rine et al. (1983) Proc. Natl. Acad. Sci. USA 80:6750. 
The chromosomal sequences included in the vector can occur 

30 either as a single segment in the vector, which results in 
the integration of the entire vector, or two segments 
homologous to adjacent segments in the chromosome and 
flanking the expression construct in the vector, which can 
result in the stable integration of only the expression 

35 construct. 
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Usually, extrachromosomal and integrating expression 
constructs may contain selectable markers to allow for the 
selection of yeast strains that have been transformed. 
Selectable markers may include biosynthetic genes that can 
5 be expressed in the yeast host, such as ADE2, HIS4 , LEU2, 
TRP1, and ALG7 . and the G418 resistance gene, which confer 
resistance in yeast cells to tunicamycin and G418, 
respectively. In addition, a suitable selectable marker 
may also provide yeast with the ability to grow in the 
10 presence of toxic compounds, such as metal. For example, 
the presence of CUP1 allows yeast to grow in the presence 
of copper ions. Butt et al. (1987) Microbiol, Rev. 51:351. 

Alternatively, some of the above-described components can 
15 be put together into transformation vectors. 

Transformation vectors are usually comprised of a 
selectable marker that is either maintained in a replicon 
or developed into an integrating vector. 

20 Expression and transformation vectors, either 

extrachromosomal replicons or integrating vectors, have 
been developed for transformation into many yeasts. For 
example, expression vectors have been developed for, inter 
alia , the following yeasts: Candida albicans . Kurtz, et 

25 al. (1986) Mol. Cell. Biol. 6:142; Candida maltosa , Kunze, 
et al. (1985) J. Basic Microbiol. 25:141; Hansenula 
polvmorpha , Gleeson, et al. (1986) J. Gen. Microbiol. 
132:3459; Roggenkamp et al. (1986) Mol. Gen. Genet. 
202:302; Kluvveromvces fraqilis . Das, et al. (1984) J. 

30 Bacteriol. 158:1165; Kluyveromyces lactis , De Louvencourt 
et al. (1983) J. Bacteriol. 154:737; Van den Berg et al. 
(1990) Bio/Technology 8:135; Pichia guillerimondii . Kunze 
et al. (1985) J. Basic Microbiol. 25:141; Pichia pastoris , 
Cregg, et al. (1985) Mol. Cell. Biol. 5:3376; U.S. 

35 4,837,148 and U.S. 4,929,555; Saccharomvces cerevisiae , 
Hinnen et al. (1978) Proc. Natl. Acad. Sci. USA 75:1929; 
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Ito et al. (1983) J. Bacterid • 153:163; 
Schizosaccharomyces pombe . Beach et al. (1981) Nature 
300:706; and Yarrowia lipolvtica . Davidow, et al. (1985) 
Curr. Genet. 10:380471 Gaillardin, et al. (1985) Curr. 
5 Genet. 10:49. 

Methods of introducing exogenous DNA into yeast hosts are 
well-known in the art, and usually include either the 
transformation of spheroplasts or of intact yeast cells 

10 treated with alkali cations. Transformation procedures 
usually vary with the yeast species to be transformed. 
See e.g., Kurtz et al. (1986) Mol. Cell. Biol. 6:142; 
Kunze et al. (1985) J. Basic Microbiol. 25:141, for 
Candida ; Gleeson et al. (1986) J. Gen. Microbioy. 

15 132:3459; Roggenkamp et al. (1986) Mol. Gen. Genet. 

202:302, for Hansenula : Das et al. (1984) J. Bacteriol. 
158:1165; De Louvencourt et al. (1983) J. Bacteriol. 
154:1165; Van den Berg et al. (1990) Bio/Technology 8:135, 
for Kluyveromyces : Cregg et al. (1985) Mol. Cell. Biol. 

20 5:3376; Kunze et al. (1985) J. Basic Microbiol. 25:141; 
U.S. 4,837,148 and U.S. 4,929,555, for Pichia ; Hinnen et 
al. (1978) Proc. Natl. Acad. Sci. USA 75;1929; Ito et al. 
(1983) J. Bacteriol. 153:163, for Saccharomvces : Beach et 
al. (1981) Nature 300:706, for Schizosaccharomyces ; 

25 Davidow et al. (1985) Curr. Genet. 10:39; Gaillardin et 
al. (1985) Curr. Genet. 10:49, for Yarrowia . 

4 . Vaccines 

30 Each of the conjugate compounds discussed herein may be 
used as a sole vaccine candidate or in combination with 
one or more other antigens from other pathogenic sources. 
These vaccines may either be prophylactic (to prevent 
infection) or therapeutic (to treat disease after 

35 infection) . 
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Such vaccines comprise the conjugate compound usually in 
combination with "pharmaceutical^ acceptable carriers", 
which include any carrier that does not itself induce the 
production of antibodies harmful to the individual 
5 receiving the composition. Suitable carriers are typ- 
ically large, slowly metabolized macromolecules such as 
proteins, polysaccharides, polylactic acids, polyglycolic 
acids, polymeric amino acids, amino acid copolymers, lipid 
aggregates (such as oil droplets or liposomes) , and 

10 inactive virus particles. Such carriers are well known to 
those of ordinary skill in the art. Additionally, these 
carriers may function as immunostimulating agents 
("adjuvants"). Furthermore, the antigen may be conjugated 
to a bacterial toxoid, such as a toxoid from diphtheria, 

15 tetanus etc. 

Preferred adjuvants to enhance effectiveness of the compo- 
sition include, but are not limited to: (1) aluminum 
salts (alum), such as aluminum hydroxide, aluminum 

20 phosphate, aluminum sulfate, etc; (2) oil-in-water 
emulsion formulations (with or without other specific 
immunostimulating agents such as muramyl peptides (see 
below) or bacterial cell wall components) , such as for 
example (a) MF59 (PCT Publ. No. WO 90/14837), containing 

25 5% Squalene, 0.5% Tween 80, and 0.5% Span 85 (optionally 
containing various amounts of MTP-PE (see below) , although 
not required) formulated into submicron particles using a 
microfluidizer such as Model HOY microf luidizer 
(Microf luidics, Newton, MA), (b) SAF, containing 10% 

30 Squalane, 0.4% Tween 80, 5% pluronic-blocked polymer L121, 
and thr-MDP (see below) either microf luidized into a 
submicron emulsion or vortexed to generate a larger 
particle size emulsion, and (c) Ribi™ adjuvant system 
(RAS) , (Ribi Immunochem, Hamilton, MT) containing 2% 

35 Squalene, 0.2% Tween 80, and one or more bacterial cell 
wall components from the group consisting of 
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monophosphorylipid A (MPL) , trehalose dimycolate (TDM) , 
and cell wall skeleton (CWS), preferably MPL + CWS 
(Detox™); (3) saponin adjuvants, such as Stimulon™ 
(Cambridge Bioscience, Worcester, MA) may be used or 
5 particles generated therefrom such as ISCOMs 

(immunostimulating complexes); (4) Complete Freunds 
Adjuvant (CFA) and Incomplete Freunds Adjuvant (IFA) ; (5) 
cytokines, such as interleukins (IL-l, IL-2, etc.)/ 
macrophage colony stimulating factor (M-CSF) , tumor 
10 necrosis factor (TNF) , etc; and (6) other substances that 
act as immunostimulating agents to enhance the 
effectiveness of the composition. Alum and MF59 are 
preferred . 

15 As mentioned above, muramyl peptides include, but are not 
limited to, N-acetyl-muramyl-L-threonyl-D-isoglut'amine 
(thr-MDP) , N-acetyl-normuramyl-L-alanyl-D-isoglutamine 
(nor-MDP) , N-acetylmuramyl-L-alanyl-D-isoglutaminyl-L- 
alanine-2- ( 1 1 -2 1 -dipalmitoyl-sn-glycero-3- 

20 huydroxyphosphoryloxy)-ethylamine (MTP-PE) , etc. 

The immunogenic compositions (e.g., the antigen, 
pharmaceutical^ acceptable carrier, and adjuvant) 
typically will contain diluents, such as water, saline, 
25 glycerol, ethanol, etc. Additionally, auxiliary 

substances, such as wetting or emulsifying agents, pH 
buffering substances, and the like, may be present in such 
vehicles. 

30 Typically, the immunogenic compositions are prepared as 
injectables, either as liquid solutions or suspensions; 
solid forms suitable for solution in, or suspension in, 
liquid vehicles prior to injection may also be prepared. 
The preparation also may be emulsified or encapsulated in 

35 liposomes for enhanced adjuvant effect, as discussed above 
under pharmaceutical^ acceptable carriers. 
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Immunogenic compositions used as vaccines comprise an 
immunologically effective amount of the antigenic 
polypeptides, as well as any other of the above-mentioned 
5 components, as needed. By "immunologically effective 
amount", it is meant that the administration of that 
amount to an individual, either in a single dose or as 
part of a series, is effective for treatment or 
prevention. This amount varies depending upon the health 

10 and physical condition of the individual to be treated, 
the taxonomic group of individual to be treated (e.g., 
nonhuman primate, primate, etc.), the capacity of the 
individual's immune system to synthesize antibodies, the 
degree of protection desired, the formulation of the 

15 vaccine, the treating doctor's assessment of the medical 
situation , and other relevant factors. It is expected 
that the amount will fall in a relatively broad range that 
can be determined through routine trials. 

20 The immunogenic compositions are conventionally adminis- 
tered parenterally, e.g., by injection, either subcutan- 
eously or intramuscularly. Additional formulations 
suitable for other modes of administration include oral 
and pulmonary formulations, suppositories, and transdermal 

25 applications. Dosage treatment may be a single dose 

schedule or a multiple dose schedule. The vaccine may be 
administered in conjunction with other immunoregulatory 
agents . 

30 5^_ Example 1 

Conjugate compounds comprising polysaccharides of the 
Meningococci C group and heat shock proteins hspR70 and 
hspR65 were constructed and tested for vaccine efficacy 

35 

5.1. Purification of polysaccharides of the 



WO 93/17712 PCT/EP93/00516 

49 



Meningococci C (MenC) group, and production of MenC 
oligosaccharides 

The meningococcal polysaccharide of group C was purified 
5 as described in Frasc C.E. "Advances in Biotechnological 
Processes: Bacterial vaccines" (A. Mizrahi, ed.), vol- 13, 
pp. 123-145, Wiley-Liss Inc., New York (1990). The 
purified polysaccharide (10 mg/ml) was depolymerised by 
hydrolysis in 0.01 M acetate buffer of pH 5, at 100°C for 
10 8 hours. The resulting product was analysed by analytical 
chromatography (on Sephadex 6-50) and exhibited a Kd 
(distribution coefficient) of 0.27. 

5.2. Introduction of a primary amine group into the 
15 terminal groups of the oligosaccharide 

0.5 M of ammonium chloride and 0.15 M of sodium 
cyanoborohydride were added to the solution obtained from 
the hydrolysis. The pH was raised to 7 and the resulting 

20 solution was kept at 35°C for one week. The oligo- 
saccharide was then purified by chromatography on Sephadex 
6-15), the void volume fractions containing chemical 
activity in respect of the amine groups and carbohydrate 
groups being collected while those containing monomeric 

25 sugars and excess of reagents were discarded. The 
resulting MenC oligosaccharide was characterised by 
determining the amine groups, the sialic acid groups and 
the O-acetyl group. The following molar ratios were 
obtained: sialic acid/amine groups = 20, O-acetyl/ sialic 

30 acid = 0.84. 

5.3. Preparation of heat shock proteins 

M. bovis BCG GroEL-type 65kDa hsp (hspR65) was expressed 
3 5 from a recombinant E.coli K12 strain harbouring plasmid 
PRIB1300 (Thole et al , Infect. Immun. 1985, 50, 800: Van 
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Eden et al, Nature, 1988, 131,171) and purified as 
described in Thole et al, Infect. Immun. , 1987,_55, 1466. 

Recombinant Af. tuberculosis DnaK-type 70kDa hsp (hspR70) 
5 was obtained and purified by ATP-agarose chromatography 
(Mehlert et al Mol. Microbiol., 1989, 2,125). 

5.4. Preparation of the glycoconjugates of MenC 
polysaccharide and hspR65 and hspR70 

10 

The MenC amino-oligosaccharide was dissolved in dimethyl- 
sulphoxide with 10% of H 2 0 and then caused to react with a 
12-fold excess (relative to the amine groups) of the 
N-hydroxysuccinimide ester of adipic acid [prepared 

15 according to Hill et al. "FEBS LETT." 102:282 (1979)]. 

After purification by precipitation with dioxane (1-4-fold 
amount) the activated oligosaccharide was dried in vacuo 
and analysed for its content of N-hydroxysuccinimino 
ester. HspR65 and hspR70 in an amount of 5 mg/ml of 0.1 M 

20 phosphate buffer of pH 7 were then caused to react with a 
300-fold molar excess of activated oligosaccharide. The 
glycoconjugates respectively obtained were freed from the 
unreacted oligosaccharides by chromatography, filtered and 
stored at 4°c. The ratio of their content of sialic acid 

25 to the percentage of sialic acid in the MenC 

polysaccharide starting material represents the amount of 
oligosaccharide which has been coupled. The protein 
content of the preparation was confirmed by the method of 
Lowry "J. Biol. Chem." 193:265 (1951). In particular, the 

30 conjugate with hspR70 has a protein content of 310 /xg/*il 
and a saccharide content of 76 jug/ml whilst the conjugate 
with hspR65 has a protein content of 180 ^g/ml and a 
saccharide content of 97 /zg/ml. 



35 5.5. Mice and immunisation 
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BALB/c (H-2 d ), C567BL/6 (H-2 b ) and CBA/J (H-2 k ) female 
mice, 8-12 weeks old, were bred from in our breeding 
facilities, 

5 The starting couples were provided by Jackson Laboratory, 
Bar Harbor, ME. BALB/c nu/nu athymic mice were obtained 
from If fa Credo, L'Arbresle, France* 

On day 0, each mouse received intraperitoneally 10 6 CFU of 
10 BCG (or PBS in the case of the control group) , followed by 
2 doses of conjugate on days 14 and 35 (in PBS) . 

Control groups received the MenC oligosaccharide alone, or 
the MenC oligosaccharide - CRM197 conjugate vaccine 

15 adsorbed on aluminium hydroxide (1 mg/dose, in 0.5 ml). In 
each immunisation, the mice received 2 /ig of MenC 
oligosaccharide, which corresponded to 8.7 ng of MenC 
oligosaccharide - CRM197 conjugate vaccine, 8.4 jig of the 
MenC oligosaccharide - hspR70 conjugate or 3.7 Mg of the 

20 MenC oligosaccharide - hspR65 conjugate. 

5.6, Determination of the antibodies according to the 



25 Each, week, blood was taken from the retro-orbital plexus 
of the mouse and the antibodies were titrated with ELISA. 

For determination of IgG anti-MenC antibodies, flat-based 
plates with 96 wells were covered with MenC polysaccharide 

30 (5 Mg/ml) (Nunc Immunoplate I, Nunc, Roskilde, Denmark) in 
PBS, pH 7.4, by overnight incubation at 37 °C. After 
repeated washings with PBS containing 0.05% of Tween-20 
(PBS-T) and incubation for one hour at 37 °C with 200 jil of 
PBS-T containing 5% of FCS, the wells were incubated 

35 overnight at 4°C with 100 /xl of mouse serum diluted in 
PBS-T containing 5% of FCS. After repeated washings, the 



ELISA method 
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plates were again incubated for 3 hours at 37 °C with 
100 Ml of an appropriate dilution of an IgG anti-mouse 
anti-serum conjugated to peroxidase. 

5 The presence of specific antibodies was revealed by 
addition of 2 , 2 1 -azino-bis- (3-ethylbenzothiazoline- 
sulphonic acid) (ABTS; Kirkegaard and Pery Laboratories 
Inc. , Gaithersburg, MD) as the substrate. The results were 
measured in terms of the absorbance at 414 nm. Samples of 
• 10 serum with an absorbance of less than 0.2 at the first 
dilution tested (1:50) were considered negative. 

5.7. Carrier effect of hpsR65 and hpsR70 conjugated to 
o 1 igo s acchar ide 

15 

HspR65 and hspR70-MenC oligosaccharide conjugates were 
used to immunise BALB/c and C57BL/6 mice which had 
previously been sensitised with BCG or were non- 
sensitised. 

20 

As a control, groups of mice received MenC only or CRM 
197-MenC conjugate vaccine in aluminium hydroxide. 

Figure 1 shows that anti-MenC polysaccharide IgG anti- 
25 bodies were produced after immunisation with hsp - MenC 
conjugates in amounts comparable to, or greater than 
(C57BL/6 in Figure 1A) , those measured in the case of a 
CRM 197-MenC conjugate vaccine in aluminium hydroxide. 
This effect was observed not only in the case of the 
30 absence of adjuvant but also, in the case of the 

hspR70-MenC conjugate, in the absence of sensitisation 
with BCG (Figure IB) . These results make it possible to 
state that immunisation using mycobacterial hsp in the 
absence of adjuvants and of sensitisation with BCG can 
35 indeed be achieved also with oligosaccharide antigens. 
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Accordingly it has been confirmed that the molecules of 
hsp, in particular hspR70, exert a powerful carrier effect 
also in mice not sensitised with BCG, and that hsp acts as 
a potent carrier of molecules capable of inducing IgG 
5 antibodies which are specific against the polysaccharide 
transported, even in the absence of the adjuvants. 

This potent carrier effect of the mycobacterial hsp, 
exerted in the absence of adjuvants and of pre- 
10 sensitisation, makes the conjugates described particularly 
useful for the development of novel vaccines against 
bacterial infections . 

6. Example 2 



A novel H. pylori heat shock protein was identified and 
produced using recombinant DNA techniques. 

6.1. Materials and methods 

20 

6. 1.1. H. pylori strains and growth conditions 

H . pylori strains used were: CCUG 17874, G39 and G33 
(isolated from gastric biopsies in the hospital of 

25 Grosseto, Italy) , Pylo 2U+ and Pylo 2U- (provided by F. 
Megraud, hospital Pellegzin, Bordeaux, France), BA96 
(isolated by gastric biopsies at the University of Siena, 
Italy) . Strain Pylo 2U+ is noncytotoxic; strain Pylo 2U- 
is noncytotoxic and urease-negative. All strains were 

3 0 routinely grown on Columbia agar containing 0.2% of 
cyclodextrin, 5/xg/ml of cefsulodin and 5/xg/ml of 
amphotericin B under microaerophilic conditions for 5-6 
days at 37"C°. Cells were harvested and washed with PBS. 
The pellets were resuspended in Laemmli sample buffer and 

35 lysed by boiling. 



15 
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Sera of patients affected by gastritis and ulcers 
(provided by A. Ponzetto, hospital "Le Molinette", Torino, 
Italy) and sera of patients with gastric carcinoma 
(provided by F. Roviello, University of Siena, Italy) were 
5 used* 

6.1.2. Immunoscreening of the Library 

Five hundred thousand plaques of a Agtll H. pylori DNA 

10 expression library were mixed with 5 ml of a suspension of 
E. coli strain Y1090 grown O/N in LB with 0.2% Maltose and 
lOmM MgS0 4 , and resuspended in lOmM MgS0 4 at 0.5 O.D. 
After 10 minutes incubation at 37 P C, 75 ml of melted 
TopAgarose were poured in the bacterial/phage mix and the 

15 whole was plated on BBL plates (50,000 plagues/plate). 
After 3.5 hrs incubation of the plated library at 42 °C, 
nitrocellulose filters (Schleicher and Schuell, Dassel, 
Germany), previously wet with lOmM IPTG, were set on 
plates and incubation was prolonged for 3.5 hrs at 37°C 

20 and then O/N at 4°C. Lifted filters with lambda proteins 
were rinse in PBS, and saturated in 5% nonfat dried milk 
dissolved in TBST (lOmM TRIS pH 8, lOOmM NaCl, 5M MgCl 2 ) 
for 20'. The first hybridization step was performed with 
the sera of patients; to develop and visualize positive 

25 plaques we used an anti human Ig antibody alkaline 

phosphatase conjugated (Cappel, West Chester, PA) and the 
NBT/BCIP kit (Promega, Madison, WI) in AP buffer (lOOmM 
Tris pH 9.5, lOOmM NaCl, 5mM MgCl 2 ) according to the 
manufacturer instructions. 

30 

6.1.3. Recombinant DNA procedures 

Reagents and restriction enzymes used were from Sigma (St. 
Louis, MO) and Boehringer (Mannheim, Germany). Standard 
35 techniques were used for molecular cloning, single- 
stranded DNA purification, transformation in E . coli, 
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radioactive labelling of probes, colony screening of the 

H. pylori DNA genomic library, Southern blot analysis, 
PAGE and Western blot analysis. 

5 6.1.4. DNA sequence analysis 

The DNA fragments were subcloned in Bluescript SK+ 
(Stratagene, San Diego, CA) . Single-stranded DNA 
sequencing was performed by using [ 33 P]adATP (New England 

10 Nuclear, Boston, MA) and the Sequenase kit (U.S. 
Biochemical Corp., Cleveland, OH) according to the 
manufacturer instructions. The sequence was determined in 
both strands and each strand was sequenced, on average, 
twice. Computer sequence analysis was performed using the 

15 GCG package. 

6.1.5. Recombinant proteins 

MS 2 polymerase fusion proteins were produced using the 
20 vector pEX34A, a derivative of pEX31. Insert Hp67 (from 
nucleotide 445 to nucleotide 1402 in Fig. 5) , and the 
EcoRI linkers were cloned in frame into the EcoRi site of 
the vector. In order to confirm the location of the stop 
codon, the HpG3 1 Hindlll fragment was cloned in frame into 
25 the Hindlll site of pEX34A. Recombinant plasmids were 

transformed in E. coli K12 HI Atrp. In both cases after 
induction, a fusion protein of the expected molecular 
weight was produced. In the case of the EcoRI/EcoRI 
fragment, the fusion protein obtain after induction was 
30 electroeluted to immunize rabbits using standard 
protocols. 

6.2. Results 



35 6.2.1. Screening of an expression library and cloning of 
H. pylori hsp 
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In order to find a serum suitable for the screening of an 
H. pylori DNA expression library, sonicated extracts of H. 
pylori strain CCUG 17874 were tested in Western blot 
5 analysis against sera of patients affected by different 
forms of gastritis. The pattern of antigen recognition by 
different sera was variable, probably due to differences 
in the individual immune response as well as to the 
differences in the antigens expressed by the strains 
10 involved in the infection. 

Serum N°19 was selected to screen a Xgtll H. pylori DNA 
expression library to identify H. pylori specific 
antigens, expressed in vivo during bacterial growth. 

15 Following screening of the library with this serum, many 
positive clones were isolated and characterized. The 
nucleotide sequence of one of these, called Hp67, revealed 
an open-reading frame of 958 base-pairs, coding for a 
protein with high homology to the hsp60 family of heat- 

20 shock proteins, Ellis, Nature 358:191-92 (1992). In order 
to obtain the entire coding region, we used fragment Hp67 
as a probe on Southern blot analysis of H. pylori DNA 
digested with different restriction enzymes. Probe Hp67 
recognized two Hindlll bands of approximately 800 and 1000 

25 base-pairs, respectively. A genomic H. pylori library of 
Hindlll-digested DNA was screened with probe Hp67 and two 
positive clones (HpG5' and HpG3 1 ) of the expected 
molecular weight were obtained. E. coli containing 
plasmids pHp60G2 (approximately nucleotides 1 to 829) and 

30 pHp60G5 (approximately nucleotides 824 to 1838) were 
deposited with the American Type Culture Collection 
(ATCC) . 

6.3. Sequence analysis 

35 

The nucleotide sequence analysis revealed an open-reading 
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frame of 1638 base-pairs, with a putative ribosome binding 
site 6 base-pairs upstream the starting ATG. Fig. 5 shows 
the nucleotide and amino acid sequences of H. pylori hsp. 
The putative ribosome-binding and the internal Hindlll 
5 site are underlined. Cytosine in position 445 and guanine 
in position 1402 are the first and last nucleotide, 
respectively, in fragment Hp67. Thymine 1772 was 
identified as the last putative nucleotide transcribed 
using an algorithm for the localization of factor- 
10 independent terminator regions. The open-reading frame 

encoded for a protein of 546 amino acids, with a predicted 
molecular weight of 58.3 KDa and a predicted pi of 5.37. 
The codon preference of this gene is in agreement with the 
H. pylori codon usage. 

15 

The analysis of the hydrophylicity profiles revealed a 
protein mostly hydrophilic, without a predicted leader 
peptide or other transmembrane domains. The amino 
terminal sequence showed 100% homology to the sequence of 

20 30 amino acids determined by Dunn et al., Infect. Immun. 
60:1946-51 (1992) on the purified protein and differed by 
only on reside (Ser42 instead of Lys) from the sequence of 
44 amino acids published by Evans et al, Infect. Immun. 
60:2125-27 (1992). (Evans et al., 1992). The N-terminal 

25 sequence of the mature hsp protein did not contain the 

starting methionine, indicating that this had been removed 
after translation. 

6.4. Homology with hsp60 family 

30 

The amino acid sequence analysis showed a very strong 
homology with the family of heat-shock proteins hsp60, 
whose members are present in every living organism. Based 
on the degree of homology between hsp60 proteins of 
35 different species, H. pylori hsp belongs to the subgroup 
of hsp60 proteins of Gram negative bacteria; however, the 
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degree of homology to the other proteins of the hsp60 
family is very high (at least 54% identity) . 

The homology of the H. pylori hsp with other heat shock 
5 proteins is fully exemplified in Figure 3. The H. pylori 
hsp or one or more functional immunostimulatory domains 
thereof may be conjugated to an oligosaccharide or 
polysaccharide using the procedures of Example 1 above to 
produce conjugate compounds according to the invention, 

10 

6.5. Expression of recombinant proteins and production 
of a polyclonal antiserum 

The inserts of clone Hp67 and of clone HpG3 1 were 
15 subcloned in the expression vector pEX34A in order to 
express these open-reading frames fused to the 
aminoterminus of the MS2 polymerase. The clones produced 
recombinant proteins of the expected size and were 
recognized by the human serum used for the initial 
20 screening. The fused protein derived from clone Hp67 was 
electroeluted and used to immunize rabbits in order to 
obtain anti-hsp specific polyclonal antisera. The 
antiserum obtained recognized both fusion proteins, and a 
protein of 58 KDa on whole-cell extracts of several 
25 strains of H. pylori tested, including a urease-negative 
strain and noncytotoxic strains. 

Hsp has been shown to be expressed by all the H. pylori 
strains tested and its expression is not associated with 

30 the presence of the urease or with the cytotoxicity. The 
protein recognized by the anti-hsp antiserum was found in 
the water soluble extracts of H. pylori and copurified 
with the urease subunits. This suggests a weak 
association of this protein with the outer bacterial 

35 membrane. Thus, hsp can be described as urease-associated 
and surface exposed. The cellular surface localization is 
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surprising as most of the hsp homologous proteins are 
localized in the cytoplasm or in mitochondria and 
plastids. The absence of a leader peptide in hsp suggests 
that this is either exported to the membrane by a peculiar 
5 export system, or that the protein is released from the 
cytoplasm and is passively adsorbed by the bacterial 
membrane after death of the bacterium. 



10 7^ Deposit of Biological Materials 

The following materials were deposited on December 15, 
1992 and January 22, 1993 by Biocine Sclavo, S.p.A., the 
assignee of the present invention, with the American Type 
Culture Collection (ATCC) , 12301 Parklawn Drive, 
Rockville, Maryland, phone (301) 231-5519, under the terms 
of the Budapest Treaty on the International Recognition of 
the Deposit of Microorganisms for Purposes of Patent 
Procedure . 

ATCC No. 69155 E. coli TGI containing the plasmid pHp60G2 
ATCC No. 69156 E. coli TGI containing the plasmid pHp605 

These deposits are provided as convenience to those of 
25 skill in the art, and are not an admission that a deposit 
is required under 35 U.S.C. §112 or any equivalent 
provision in any one of the designated states herein. The 
nucleic acid sequences of these deposits, as well as the 
amino acid sequences of the polypeptides encoded thereby, 
3 0 are incorporated herein by reference and should be 

referred to in the event of any error in the sequences 
described herein as compared with the sequences of the 
deposits. A licence may be required to make, use, or sell 
the deposited materials, and no such license is granted 
35 hereby. 
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CLAIMS 

1. A conjugate compound comprising at least one heat 
shock protein or portion thereof including at least one 

5 imiaunostimulatory domain and at least one oligosaccharide 
or polysaccharide, 

2. A conjugate compound according to claim 1 
comprising oligosaccharides of the Meningococci C (MenC) 

10 group and a heat shock protein. 

3. A conjugate compound according to claim 1 or 2 
wherein the heat shock protein is selected from M. bovis 
BCG GroEL-type 65kDa hsp (hspR65) , recombinant 

15 M. tuberculosis DnaK-type 70kDa hsp (hspR70) and a heat 
shock protein from H. pylori* 

4. A process for producing conjugate compounds 
according to any one of claims 1 to 3 which comprises 

20 covalently bonding a heat shock protein or portion thereof 
including at least one immunostimulatory domain to at 
least one oligosaccharide or polysaccharide. 

5. A conjugate compound according to any one of claims 
25 1 to 3 'for use as a pharmaceutical. 

6. Use of a conjugate compound according to any one of 
claims 1 to 3 in the manufacture of a medicament for 
vaccination against bacterial infection. 

30 

7. A method of vaccination comprising administering an 
immunologically effective amount of a conjugate compound 
according to any one of claims 1 to 3. 

35 8. A vaccine or therapeutic composition comprising one 
or more conjugate compounds according to any one of claims 
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1 to 3 and a pharmaceutically acceptable carrier • 

9. A method for the preparation of a vaccine 
comprising bringing one or more conjugate compounds 
5 according to any one of claims 1 to 3 into association 
with a pharmaceutically acceptable carrier. 



WO 93/17712 PCT/EP93/00516 



1/7 



BALB/c 



CS7BU6 



(0 
0) 



>. 

0 

n 
c 

(0 

0 

0) 



0) 
0 

J 



5-1 
4.5- 

4- 
3.5 

3 

2.5-1 
2 
1.5 

5i 
4.5 

4 
3.5 

3- 
2.5 

2- 
1.5 




•D CRM197-MenC 
•A- MenC 
O- hspR65 
-0- hspR70 



A - -A - - A 




A) 



A- A- -A- -A- -A 



-i 1 1 r 



-i 1 1 r 



0 7 14 21 28 35 42 49 0 7 14 21 28 35 42 49 





B) 



i 1 1 r 



f 1 1 1 1 1 1 r 

7 14 21 28 35 42 49 0 7 14 21 28 35 42 49 
Days post-BCG priming 



FIG. 1 



SUBSTITUTE SHEET 



WO 93/17712 



PCI7EP93/00516 



m 


2 






PS 




H 






• 


• 


• 




S 




• 


• 


• 




w 




w 


w 


Q 




a 




W 


w 






CO 




2 


Q 






fa 






• 








• 




• 




H 


> 


• 


• 


> 




w 




2 




Q 








• 


• 


• 










• 


• 








> 


• 










x 


< 


a 










fa 












X 


H 












h3 










DS 


X 










IX 


fa 










M 


< 



fa 
w 

< 

•J 

> 
fa 



x o 



fa 
> 

H 
fa 
►4 
CC 

X 



o 
o 
o 

2 

w 
> 

< 
fa 

X 

OS 
w 
fa 

8 

CO 
< 

fa 

o 

H 
W 

a 
a 
ex 
o 

(Q fa 
rd > 

CD -p 
0 

c e 

•h o a> 

3 O 14 

S Q> U Q) m 
a ra -P rH CD 



0 



CD 

c 
a) 



X fa p x X 




O 

CM 
O 



h cm n ^ in 



h w n ^ in 



h cm n ^ in 



SUBSTITUTE SHEET 



WO 93/17712 



PCI7EP93/00516 





a 


• 


• 


CO 


w 


m 


CM 




> 




> 


CO 




CM 


h3 


• 


H 


• 


• 


CN 




■J 


• 


• 


• 


• 






M 




> 


• 


> 








►J 




• 


• 








04 


• 


04 










V) 


a 


pa 








a 


• 


pa 


w 








►j 


• 






Pu 






O* 




> 


> 


Pa 








• 


u 


• 


a 






Eh 


< 


w 


w 












OOiQi 












uu 








w 


o 




* 








a: 


o< 






CO 








X 


• 


u 


H 






H 








2 






> 


• 






M 








v 






• 




>4 




» 


; 






04 




CO 


O 






CO 












»J 








M 




X 








• 






• 






• 




os 






»*« 






Q 




2 




• 




Eli 






t 


» 




o 




2 


















• 














• 














• 








• 






M 














M 






O 


CO 




pa 


pa 














• 






w 




> 


ci 


» 






Q 


2 






• 






W 


• 




a 


2 






M 




pj 


Pu 


►J 






O 






H 








*S 


CO 




2 


• 








O 




CO 


O 














O 




























• 






H 












M 








• 




> 




CO 




• 




u 








• 




Q 




2 


Cxi 






*S 






2 


o: 




O 








• 




> 








• 












• 




w 






Q 






Q 


w 


pa 


W 






















rH 
















H 












X* 




Q 


1 


|. 




o 














M 














2 


CO 




CO 


pa 1 




in 


X 


u 




o» 




1 CM 


rH 


Q 




• 






CM 
















rH 


CM 


n 




in 



3/7 



H 






• 


• 




pa 








• 


• 


CM 




H 






X 


X 




a 


Q 






• 


• 


rn 




X* 


o» w 




• 




Q 








• 


• 




CO 




• 






O 










< 


















• 




CO 


os 






• 


• 




<; 


os 






• 


2 




H 


a 


• 


• 


• 


• 




a 


o 




* 


• 


• 












• 


• 






PG 


# 




• 


• 




M 






• 


• 




0* 




m 




• 


• 




< 






# 








r> 




# 






• 




o: 








• 


• 




a 




# 


rj 


> 


? 1 


I 




H 


> 


> 


CO 


> 


1 


> 




x" 


PS 






1 


Q 


,1 
M 


*> 


Pu 




* 


1 


Q 






w 






1 


CO 










> 


1 


X 




| 




• 






£ 






IH 




«l 


I 




5^ 












O 




• 




• 


* 




Q 


> 


• 




a 






> 




• 


• 




* 




M 


►—4 


• 


* 




• 




H 


H 


• 




• 


• 




H 


Eh 






CO 


CO 




2 




• 




• 


• 




a 




• 




• 


« 






pa 


• 










u 


o 


• 






• 




M 


w 


• 




• 


Q 




> 


M 


> 


• 


> 


> 




M 


Q 


• 


• 


• 


• 




OS 


W 


• 


• 


• 


• 




o 


< 


• 


t 


• 


• 






M 


> 




• 


• 






M 


• 




• 


• 




a 


•-3 


• 




• 


> 








• 




• 


♦ 




PL4 


04 


• 




CO 


• 




pa 






o: 


• 


• 




> 






• 








pa 




< 


CO 


< 










• 


pa 


o 


< 




2 




< 


< 


H 


2 




W 




> 


> 


> 


< 




»J 




< 




• 


M 




CO 




• 


<y 


• 


• 




*J 


1-4 








• 




o 




> 


M 








•J 


04 


* 


• 




• 




1 




• 


• 




> 




w 


M 


X 


Pt4 




• 




pa 


Q 








CO 




CO 




o: 


• 


t 


a 




H 


2 


M 


H 


> 


M 




> 


CO 


2 


o 










CO 


• 


• 


• 






o 


H 


• 


• 


> 










• 


• 


• 






8 




• 




CO 




o\ 














CM 


















rH 


CM 






in 







• 


• 


• 

OS 


• 
• 








• 


pa 




• 




• 


• 




• 




• 


ea 




• 
• 


CO 


a 


• 






CO 


CO 








Q 


2 


> 




w 


pa 




a 




• 


• 


• 






• 


• 




• 








• 


w 




o: 


# 


OS 


M 




• 


CO 


• 


ea 






u 


• 








u 


• 






• 


• 

< 


• 

a 


• 




pa 




< 


pa 




H 


•J 


M 


M 






< 








<• 


pa 


• 








H 






> 


pa 


a 


a 




• 

< 


• 


• 


• 
• 




• 


» 


• 






• 


pa 


pa 








• 


• 






• 


• 
• 


• 
» 


t-3 




• 


• 


• 






pa 


ED 


ta 
• 


a 
• 


CN 


• 

2 


CO 


• 

H 


• 

Eh 




• 
• 


> 

M 


X 
• 


> 

M 


d 


> 
• 


AM 


KV 


EV 








OS 


• 




04 






> 






• 
X 


• 


• 
Q 




• 


< 


CO 


X 










04 




H 


2 


a 


c> 






• 




> 




O 


• 




Q 




• 


• 


* 


» 




• 
• 


• 


• 


2 




• 


X 


• 


• 




• 


• 


• 


Eh 




> 




> 


• 




1 


1 










• 




• 




• 




• 




• 




o 




• 




PE4 








• 


Eh 


• 




< 




• 




• 




• 




• 




• 




• 




• 




< 


CM 


n 




in 



SUBSTITUTE SHEET 



WO 93/17712 PCT/EP93/00516 





1 


Q 






1 


GO X 


• 


• 


• 


• 




a 






• 


cu 


a> w 








> 




2 




i 




H 




• 




• 


• 






• 


• 


• 


• 




• 




• 


• 




I 


1 


fa 


1 


1 




< 




> 








o 






co 


cu 


• 




•• 


• 




X 


w 


w 


Q 


Q 


Q 


• 




• 


• 




> 


M 






•J 


M 








• 






*: 


H 




< 


M 


• 






• 




o* 




cu 




a* 


o 


• 




• 


• 








H 


• 


M 


w 


X 


< 












u 




o 




w 


fa 




fa 








• 






fa 


H 




h3 


> 




M 


> 


• 




J 




» 


• 




• 








• 


• 




O 


• 


• 




2 




t 




• 






> 






fa 


• 






> 




> 


6 


>< 


• 


• 




fa 




O 










x: 


> 


< 


fa 


a 




o 




♦ 






o 


• 


a 




• 




o 




• 


• 




2 








< 



a* 
►J 



fa 

< 



OS 



CO 
►J 

w 

m O 







to 






3* 
































* 


• 




2 


t 


Q • 


Q 










fa 






fa 








^ X 


g 












O 




• • 


• 


if) o 














fa 




fa X 






§ 






• 




t 


X 




2 O 


fa 


X 






4c 




• 






• 


< > 


CO 




• 


>l 


fa 


• 


• 




w 


co 


CO CO 


CO 




• 


Q 


a 


X 


• 


OS 




a 


« J 


CO 




• 


♦ 


• 


o 


< 








CO 2 


o 


o 




• 






2 




fa 




X 




o 




< 




• 








> 


• 


• • 


w 




• 






• 


U 


> 




w 








§ 


Q 


t 




• 




• 




2 


Q 


a w 


fa 


o 








• 


(N 










fa < 


• 


X 


• 




H 


• 


w 








H • 


M 


a 


o 


< 


IX 


< 




M 




> 




M • 






o 


• 


Q 












CO 


< • 


CO 




t 


1 


CO 


• 


d 


X 






a* 




• 




• 


• 


• 






t 




a 


fa 


fa fa 


fa 


a 


• 




• 


• 




OS 






a 


x 


> 




1 


< 


< 


a 








o 




• • 


• 


< 


1 




• 










< 




• CO 


♦ 






• 




fa 






> 


2 




• • 


• 


fa 


Q 






• 




* 




H 




< fa 




fa 






• 






• 






> 


• • 


• 




> 


a! 








• 




M 


• 


• • 


• 


H 


• 


• 




• 






• 


2 


• 


• • 




fa 


• 


• 


Q 






• 










X 


X 


< 


< 


< 








U 




> 




< 


> 




M 




• 


















X 




> 






< 






co 




M 




















fa 


CO fa 




fa 


• 












J 


H 


> 




J 


h 


• 






< 




• 








• • 




H 








• 












« > 


• 


>1 


M 














x 










X 




fa 










M 


iJ 


> > 




l-J 












♦ 


Q 


M 


•J 






CO 


6 




O 






< 




w 


< 


IX 2 


• 


CO 


o 


< 


< 


< 




• 




>« 


M 


< < 


M 


> 


H 


> 




• 




• 






• 






CO 


• 




• 






• 




1 


• 




M 


> 


< 


< 


< 


< 










2 


0* < 






» 






• 








fa 


o» 




6 


2 


< 


CO 




Q 




• 




Q 


fa 


• • 


• 


a 


• 


fa 








• 


2 


O Q 


fa 


fa O 


fa 




• 


• 




• 








n x 


2 


2 1 


2 


<r> < 


• 






• 




• 




i 


1 


H H 


< 




CO 


CO 


CO 







h (N n in 



h (M n m 



h (N rn ^ in 



SUBSTITUTE SHEET 



WO 93/17712 



PCT/EP93/00516 



5 /7 



10 30 50 

AAGCTT6CTGTCATGATCACAAAAAACACTAAAAAACATTATTATTAA6GATACAAAATG 

M 

70 90 110 

GCAAAAGAAATCAAATTTTCAGATAGTGCGAGAAACCTTTTATTTGAAGGCGTGAGGCAA 
AKEIKFSDSARNLLFEGVRQ 

130 150 170 

CTCCATGACGCTGTCAAAGTAACCATGGGGCCAAGAGGCAGGAATGTATTGATCCAAAAA 
L.HDAVKVTMGPRGRNVL I QK 

190 210 230 

AGCTATGGCGCTCCAAGCATCACCAAAGACGGCGTGAGCGTGGCTAAAGAGATTGAATTA 
SYGAPS I TKDGVSVAKEIEL 

250 270 290 

AGTTGCCCAGTAGCTAACATGGGCGCTCAACTCGTTAAAGAAGTAGCGAGCAAAACCGCT 
SCPVANMGAQLVKEVASKTA 

310 330 350 

GATGCTGCCGGCGATGGCACGACCACAGCGACCGTGCTAGCTTATAGCATTTTTAAAGAA 
DAAGDGTTTATVL AYS I F K E 

370 390 410 

GGTTTGAGGAATATCACGGCTGGGGCTAACCCTATTGAAGTGAAACGAGGCATGGATAAA 
GLRNITAGANPIEVKRGMDK 

430 450 470 

GCTGCTGAAGCGATCATTAATGAGCTTAAAAAAGCGAGCAAAAAAGTAGGCGGTAAAGAA 
AAEAI I NELKKASKKVGG KE 

490 510 530 

GAAATCACCCAAGTGGCGACCATTTCTGCAAACTCCGATCACAATATCGGGAAACTCATC 
EITQVATISANSDHNIGKLI 

550 570 590 

GCTGACGCTATGGAAAAAGTGGGTAAAGACGGCGTGATCACCGTTGAGGAAGCTAAGGGC 
ADAMEKV GKDGV I TVEE AKG 

610 630 650 

ATTGAAGATGAATTGGATGTCGTAGAAGGCATGCAATTTGATAGAGGCTACCTCTCCCCT 
IEDELDVVEGMQFDRGYLSP 



FIG. 3(a) 
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6/ ? 



670 690 710 

TATTTT6TAACGAAC6CTGAGAAAATGACCGCTCAATTG6ATAATGCTTACATCCTTTTA 
YFVTNAEKMTAQLDNAYILL 

730 750 770 

ACGGATAAAAAAATCTCTAGCATGAAAGACATTCTCCCGCTACTAGAAAAAACCATGAAA 
TDKKISSMKDI L P LLEKTMK 

790 810 Hindi 1 1 

GAGGGCAAACCGCTTTTAATCATCGCTGAAGACATTGAGGGC GAAGCTT TAACGACTCTA 
EGKPLLI IAEDIEGEALTTL 

850 870 890 

GTGGTGAATAAATTAAGAGGCGTGTTGAATATCGCAGCGGTTAAAGCTCCAGGCTTTGGG 
VVNKLRGVLNI AA'VKAPGFG 

910 930 950 

GACAGAAGAAAAGAAATGCTCAAAGACATCGCTATTTTAACCGGCGGTCAAGTCATTAGC 
DRRKEMLKDIAILTGGQVIS 

970 990 1010 

GAAGAATTGGGCTTGAGTCTAGAAAACGCTGAAGTGGAGTTTTTAGGCAAAGCTGGAAGG 
EELGLS LENAEVEFLGKAGR 

1030 1050 1070 

ATTGTGATTGACAAAGACAACACCACGATCGTAGATGGCAAAGGCCATAGCGATGATGTT 
IVIDKDNTTIVDGKGHSDDV 

1090 1110 1130 

AAAGACAGAGTCGCGCAGATCAAAACCCAAATTGCAAGTACGACAAGCGATTATGACAAA 
KDRV AQI KTQIASTTSDYDK 

1150 1170 1190 

GAAAAATTGCAAGAAAGATTGGCTAAACTCTCTGGCGGTGTGGCTGTGATTAAAGTGGGC 
EKLQERLAKLSGGVAVI KVG 

1210 1230 1250 

GCTGCGAGTGAAGTGGAAATGAAAGAGAAAAAAGACCGGGTGGATGACGCGTTGAGCGCG 
AASEVEMKEKKDRVDDALSA 

1270 1290 1310 

ACTAAAGCGGCGGTTGAAGAAGGCATTGTGATTGGTGGCGGTGCGGCTCTCATTCGCGCG 
TKAAVEEGIVIGGGAALIRA 



FIG. 3(b) 
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Vi 



1330 1350 1370 

GCTCAAAAA6T6CATTT6AATTTGCACGATGATGAAAAAGT66GCTATGAAATCATCATG 
AQKVHLNLHDDEKVGYEI I M 

1390 1410 1430 

CGCGCCATTAAAGCCCCATTAGCTCAAATCGCTATCAACGCTGGTTATGATGGCGGTGTG 
RA I.KAPLAQ I A I NAGY DGGV 

1450 1470 1490 

GTCGTGAA.TGAAGTAGAAAAACACGAAGGGCATTTTGGTTTTAACGCTAGCAATGGCAAG 
VVNEVEKHEGHFGFNASNGK 

1510 1530 1550 

TATGTGGATATGTTTAAAGAAGGCATTATTGACCCCTTAAAAGTAGAAAG'GATCGCTCTA 
YVDMFKEG I IDPLKVERIAL 

1570 1590 1610 

CAAAATGCGGTTTCGGTTTCAAGCCTGCTTTTAACCACAGAAGCCACCGTGCATGAAATC 
QNAVSVSSLLLTTEATVHEI 

1630 1650 1670 

AAAGAAGAAAAAGCGACTCCGGCAATGCCTGATATGGGTGGCATGGGCGGTATGGGAGGC 
K. EEKATPAMPDMGGMGGM-GG 

1690 1710 1730 

ATGGGCGGCATGATGTAAGCCCGCTTGCTTTTTAGTATAATCTGCTTTTAAAATCCCTTC 
M G G M M * 

1750 1770 1790 

TCTAAATCCCCCCCTTTCTAAAATCTCTTTTTTGGGGGGGTGCTTTGATAAAACCGCTCG 

1810 1830 
CTTGTAAAAACATGCAACAAAAAATCTCTGTTAAGCTT 



FIG. 3(c) 
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